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Finite Volume Methods for Hyperbolic Problems 



This book contains an introduction to hyperbolic partial differential equations and a pow- 
erful class of numerical methods for approximating their solution, including both linear 
problems and nonlinear conservation laws. These equations describe a wide range of wave- 
propagation and transport phenomena arising in nearly every scientific and engineering 
discipline. Several applications are described in a self-contained manner, along with much 
of the mathematical theory of hyperbolic problems. High-resolution versions of Godunov’s 
method are developed, in which Riemann problems are solved to determine the local wave 
structure and limiters are then applied to eliminate numerical oscillations. These meth- 
ods were originally designed to capture shock waves accurately, but are also useful tools 
for studying linear wave-propagation problems, particularly in heterogenous material. The 
methods studied are implemented in the CLAWPACK software package. Source code for all 
the examples presented can be found on the web, along with animations of many time- 
dependent solutions. This provides an excellent learning environment for understanding 
wave-propagation phenomena and finite volume methods. 

Randall LeVeque is the Boeing Professor of Applied Mathematics at the University of 
Washington. 
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Preface 



Hyperbolic partial differential equations arise in a broad spectrum of disciplines where 
wave motion or advective transport is important: gas dynamics, acoustics, elastodynamics, 
optics, geophysics, and biomechanics, to name but a few. This book is intended to serve 
as an introduction to both the theory and the practical use of high-resolution finite volume 
methods for hyperbolic problems. These methods have proved to be extremely useful in 
modeling a broad set of phenomena, and I believe that there is need for a book introducing 
them in a general framework that is accessible to students and researchers in many different 
disciplines. 

Historically, many of the fundamental ideas were first developed for the special case 
of compressible gas dynamics (the Euler equations), for applications in aerodynamics, 
astrophysics, detonation waves, and related fields where shock waves arise. The study of 
simpler equations such as the advection equation. Burgers’ equation, and the shallow water 
equations has played an important role in the development of these methods, but often only as 
model problems, the ultimate goal being application to the Euler equations. This orientation 
is still reflected in many of the texts on these methods. Of course the Euler equations remain 
an extremely important application, and are presented and studied in this book, but there 
are also many other applications where challenging problems can be successfully tackled 
by understanding the basic ideas of high-resolution finite volume methods. Often it is not 
necessary to understand the Euler equations in order to do so, and the complexity and 
peculiarities of this particular system may obscure the more basic ideas. 

In particular, the Euler equations are nonlinear. This nonlinearity, and the consequent 
shock formation seen in solutions, leads to many of the computational challenges that moti- 
vated the development of these methods. The mathematical theory of nonlinear hyperbolic 
problems is also quite beautiful, and the development and analysis of finite volume methods 
requires a rich interplay between this mathematical theory, physical modeling, and numer- 
ical analysis. As a result it is a challenging and satisfying field of study, and much of this 
book focuses on nonlinear problems. 

However, all of Part I and much of Part III (on multidimensional problems) deals en- 
tirely with linear hyperbolic systems. This is partly because many of the concepts can 
be introduced and understood most easily in the linear case. A thorough understanding 
of linear hyperbolic theory, and the development of high-resolution methods in the linear 
case, is extremely useful in fully understanding the nonlinear case. In addition, I believe 
there are many linear wave-propagation problems (e.g., in acoustics, elastodynamics, or 
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electromagnetics) where these methods have a great deal of potential that has not been fully 
exploited, particularly for problems in heterogeneous media. 1 hope to encourage students 
to explore some of these areas, and researchers in these areas to learn about finite vol- 
ume methods. 1 have tried to make it possible to do so without delving into the additional 
complications of the nonlinear theory. 

Studying these methods in the context of a broader set of applications has other pedagog- 
ical advantages as well. Identifying the common features of various problems (as unified 
by the hyperbolic theory) often leads to a better understanding of this theory and greater 
ability to apply these techniques later to new problems. The finite volume approach can 
itself lead to greater insight into the physical phenomena and mathematical techniques. The 
derivation of most conservation laws gives first an integral formulation that is then converted 
to a differential equation. A finite volume method is based on the integral formulation, and 
hence is often closer to the physics than is the partial differential equation. 

Mastering a set of numerical methods in conjunction with learning the related mathemat- 
ics and physics has a further advantage: it is possible to apply the methods immediately in 
order to observe the behavior of solutions to the equations, and thereby gain intuition for 
how these solutions behave. To facilitate this hands-on approach to learning, virtually every 
example in the book (and many examples not in the book) can be solved by the reader using 
programs and data that are easy to download from the web. The basis for most of these pro- 
grams is the CLAWPACK software package, which stands for “conservation-law-package.” 
This package was originally developed for my own use in teaching and so is intimately 
linked with the methods studied in this book. By having access to the source code used 
to generate each figure, it is possible for the interested reader to delve more deeply into 
implementation details that aren’t presented in the text. Animations of many of the figures 
are also available on the webpages, making it easier to visualize the time-dependent nature 
of these solutions. By downloading and modifying the code, it is also possible to experiment 
with different initial or boundary conditions, with different mesh sizes or other parameters, 
or with different methods on the same problem. 

CLAWPACK has been freely available for several years and is now extensively used for 
research as well as teaching purposes. Another function of this book is to serve as a reference 
to users of the software who desire a better understanding of the methods employed and the 
ways in which these methods can be adapted to new applications. The book is not, however, 
designed to be a user’s manual for the package, and it is not necessary to do any computing 
in order to follow the presentation. 

There are many different approaches to developing and implementing high-resolution 
finite volume methods for hyperbolic equations. In this book I concentrate primarily on one 
particular approach, the wave-propagation algorithm that is implemented in CLAWPACK, 
but numerous other methods and the relation between them are discussed at least briefly. 
It would be impossible to survey all such methods in any detail, and instead my aim is 
to provide enough understanding of the underlying ideas that the reader will have a good 
basis for learning about other methods from the literature. With minor modifications of the 
CLAWPACK code it is possible to implement many different methods and easily compare 
them on the same set of problems. 

This book is the result of an evolving set of lecture notes that I have used in teaching 
this material over the past 15 years. An early version was published in 1989 after giving 
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the course at ETH in Zurich [281]. That version has proved popular among instructors and 
students, perhaps primarily because it is short and concise. Unfortunately, the same claim 
cannot be made for the present book. I have tried, however, to write the book in such a 
way that self-contained subsets can be extracted for teaching (and learning) this material. 
The latter part of many chapters gets into more esoteric material that may be useful to have 
available for reference but is not required reading. In addition, many whole chapters can be 
omitted without loss of continuity in a course that stresses certain aspects of the material. In 
particular, to focus on linear hyperbolic problems and heterogeneous media, a suggested set 
of chapters might be 1-9 and 18-21, omitting the sections in the multidimensional chapters 
that deal with nonlinearity. Other chapters may also be of interest, but can be omitted 
without loss of continuity. To focus on nonlinear conservation laws, the basic theory can 
be found in Chapters 1-8, 11-15, and 18-21. Again, other topics can also be covered if 
time permits, or the course can be shortened further by concentrating on scalar equations 
or one-dimensional problems, for example. 

This book may also be useful in a course on hyperbolic problems where the focus is not 
on numerical methods at all. The mathematical theory in the context of physical applications 
is developed primarily in Chapters 1-3, 9, 11, 13, 14, 16, 18, and 22, chapters that contain 
little discussion of numerical issues. It may still be advantageous to use CLAWPACK to 
further explore these problems and develop physical intuition, but this can be done without 
a detailed study of the numerical methods employed. 

Many topics in this book are closely connected to my own research. Repeatedly teach- 
ing this material, writing course notes, and providing students with sample programs has 
motivated me to search for more general formulations that are easier to explain and more 
broadly applicable. This work has been funded for many years by the National Science 
Foundation, the Department of Energy, and the University of Washington. Without their 
support the present form of this book would not have been possible. 

I am indebted to the many students and colleagues who have taught me so much about 
hyperbolic problems and numerical methods over the years. I cannot begin to thank everyone 
by name, and so will just mention a few people who had a particular impact on what is 
presented in this book. Luigi Quartapelle deserves high honors for carefully reading every 
word of several drafts, finding countless errors, and making numerous suggestions for 
substantial improvement. Special thanks are also due to Mike Epton, Christiane Helzel, Jan 
Olav Langseth, Sorin Mitran, and George Turkiyyah. Along with many others, they helped 
me to avoid a number of blunders and present a more polished manuscript. The remaining 
errors are, of course, my own responsibility. 

I would also like to thank Cambridge University Press for publishing this book at a 
reasonable price, especially since it is intended to be used as a textbook. Many books are 
priced exorbitantly these days, and I believe it is the responsibility of authors to seek out 
and support publishers that serve the community well. 

Most importantly, I would like to thank my family for their constant encouragement and 
support, particularly my wife and son. They have sacrificed many evenings and weekends 
of family time for a project that, from my nine-year old’s perspective at least, has lasted a 
lifetime. 



Seattle, Washington, August, 2001 




Introduction 



Hyperbolic systems of partial differential equations can be used to model a wide variety of 
phenomena that involve wave motion or the advective transport of substances. This chapter 
contains a brief introduction to some of the fundamental concepts and an overview of the 
primary issues discussed in this book. 

The problems we consider are generally time-dependent, so that the solution depends 
on time as well as one or more spatial variables. In one space dimension, a homogeneous 
first-order system of partial differential equations in x and t has the form 

q t (x, t) + Aq x (x, t) = 0 (1.1) 

in the simplest constant-coefficient linear case. Here q : R x R -» R"' is a vector with 
m components representing the unknown functions (pressure, velocity, etc.) we wish to 
determine, and A is a constant m x rn real matrix. In order for this problem to be hyperbolic, 
the matrix must satisfy certain properties discussed below. Note that subscripts are used to 
denote partial derivatives with respect to t and x. 

The simplest case is the constant-coefficient scalar problem, in which m — 1 and the 
matrix A reduces to a scalar value. This problem is hyperbolic provided the scalar A is 
real. Already this simple equation can model either advective transport or wave motion, 
depending on the context. 

Advective transport refers to a substance being carried along with fluid motion. For ex- 
ample, consider a contaminant being advected downstream with some fluid flowing through 
a one-dimensional pipe at constant velocity ii. Then the concentration or density q(x, t) of 
the contaminant satisfies a scalar advection equation of the form 

q t (x, t ) + uq x (x, t) — 0, (1.2) 

as derived in Chapter 2. It is easy to verify that this equation admits solutions of the form 

q{x , t) = q{x — ut) (1.3) 

for any function q ('£,). The concentration profile (or waveform) specified by q simply prop- 
agates with constant speed u and unchanged shape. In this context the equation (1.2) is 
generally called the advection equation. 

The phenomenon of wave motion is observed in its most basic form if we model a sound 
wave traveling down a tube of gas or through an elastic solid. In this case the molecules of 
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the gas or solid barely move, and yet a distinct wave can propagate through the material 
with its shape essentially unchanged over long distances, and at a speed c (the speed of 
sound in the material) that is much larger than the velocity of material particles. We will see 
in Chapter 2 that a sound wave propagating in one direction (to the right with speed c >0) 
can be modeled by the equation 



u> t (x, t) + cw x (x , t) = 0, 



(1-4) 



where w(x, t) is an appropriate combination of the pressure and particle velocity. This again 
has the form of a scalar first-order hyperbolic equation. In this context the equation (1.4) is 
sometimes called the one-way wave equation because it models waves propagating in one 
particular direction. 

Mathematically the advection equation (1.2) and the one-way wave equation (1.4) are 
identical, which suggests that advective transport and wave phenomena can be handled by 
similar mathematical and numerical techniques. 

To model acoustic waves propagating in both directions along a one-dimensional medium, 
we must consider the full acoustic equations derived in Chapter 2, 



p t (x, t) + Ku x (x, t) = 0, 

u t (x , t ) + (1 /p)p x (x, t) = 0 , 



(1-5) 



where p(x, t) is the pressure (or more properly the perturbation from some background 
constant pressure), and u(x. t) is the particle velocity. These are the unknown functions to 
be determined. The material is described by the constants K (the bulk modulus of com- 
pressibility) and p (the density). The system (1.5) can be written as the first-order system 
q, + Aq x = 0, where 



q 




\ = 



0 K 

1/P 0 



( 1 . 6 ) 



To connect this with the one-way wave equation (1.4), let 

w l (x, t ) = p(x, t ) + pcu(x, t ), 



where c = «JK / p. Then it is easy to check that w l (x, t) satisfies the equation 

wj + cw\ = 0 



and so we see that c can be identified as the speed of sound. On the other hand, the function 

w 2 {x , t ) = p(x , t ) — pcu(x , t) 



satisfies the equation 

w ] — civ 2 — 0. 

This is also a one-way wave equation, but with propagation speed — c. This equation has 
solutions of the form q 2 {x, t) — q(x + ct ) and models acoustic waves propagating to the 
left at the speed of sound, rather than to the right. 
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The system (1.5) of two equations can thus be decomposed into two scalar equations 
modeling the two distinct acoustic waves moving in different directions. This is a funda- 
mental theme of hyperbolic equations and crucial to the methods developed in this book. 
We will see that this type of decomposition is possible more generally for hyperbolic sys- 
tems, and in fact the definition of “hyperbolic” is directly connected to this. We say that 
the constant-coefficient system (1.1) is hyperbolic if the matrix A has real eigenvalues and 
a corresponding set of m linearly independent eigenvectors. This means that any vector in 
R" ! can be uniquely decomposed as a linear combination of these eigenvectors. As we will 
see in Chapter 3, this provides the decomposition into distinct waves. The corresponding 
eigenvalues of A give the wave speeds at which each wave propagates. For example, the 
acoustics matrix A of (1.6) has eigenvalues — c and +c, the speeds at which acoustic waves 
can travel in this one-dimensional medium. 

For simple acoustic waves, some readers may be more familiar with the second-order 
wave equation 

p t ,=c 2 p xx . (1.7) 

This equation for the pressure can be obtained from the system (1.5) by differentiating the 
first equation with respect to t and the second with respect to x, and then eliminating the 
u xt terms. The equation (1.7) is also called a hyperbolic equation according to the stan- 
dard classification of second-order linear equations into hyperbolic, parabolic, and elliptic 
equations (see [234], for example). In this book we only consider first-order hyperbolic 
systems as described above. This form is more fundamental physically than the derived 
second-order equation, and is more amenable to the development of high-resolution finite 
volume methods. 

In practical problems there is often a coupling of advective transport and wave motion. 
For example, we will see that the speed of sound in a gas generally depends on the density 
and pressure of the gas. If these properties of the gas vary in space and the gas is flowing, then 
these variations will be advected with the flow. This will have an effect on any sound waves 
propagating through the gas. Moreover, these variations will typically cause acceleration 
of the gas and have a direct effect on the fluid motion itself, which can also be modeled as 
wave-propagation phenomena. This coupling leads to nonlinearity in the equations. 



1.1 Conservation Laws 

Much of this book is concerned with an important class of homogeneous hyperbolic equa- 
tions called conservation laws. The simplest example of a one-dimensional conservation 
law is the partial differential equation (PDE) 

q,(x, t) + f(q(x, t)) x — 0, (1.8) 

where f(q) is the flux function. Rewriting this in the quasilinear form 

q, + f'(q)q x = 0 (1.9) 



suggests that the equation is hyperbolic if the flux Jacobian matrix f'(q) satisfies the con- 
ditions previously given for the matrix A. In fact the linear problem (1.1) is a conservation 
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law with the linear flux function f(q) — Aq. Many physical problems give rise to nonli- 
near conservation laws in which f(q) is a nonlinear function of q, a vector of consen’ed 
quantities. 



1.1.1 Integral Form 

Conservation laws typically arise most naturally from physical laws in an integral form as 
developed in Chapter 2, stating that for any two points X\ and xj, 



d 

dt 




q(x, t ) dx = f(q(x i, 0) - f(q(x 2 , t)). 



( 1 . 10 ) 



Each component of q measures the density of some conserved quantity, and the equation 
(1.10) simply states that the “total mass” of this quantity between any two points can change 
only due to the flux past the endpoints. Such conservation laws naturally hold for many 
fundamental physical quantities. For example, the advection equation (1.2) for the density 
of a contaminant is derived from the fact that the total mass of the contaminant is conserved 
as it flows down the pipe and the flux function is f(q) = uq. If the total mass of contaminant 
is not conserved, because of chemical reactions taking place, for example, then the con- 
servation law must also contain source terms as described in Section 2.5, Chapter 17, and 
elsewhere. 

The constant-coefficient linear acoustics equations (1.5) can be viewed as conservation 
laws for pressure and velocity. Physically, however, these are not conserved quantities 
except approximately in the case of very small amplitude disturbances in uniform media. In 
Section 2.7 the acoustics equations are derived from the Euler equations of gas dynamics, 
the nonlinear conservation laws that model more general disturbances in a compressible gas. 
These equations model the conservation of mass, momentum, and energy, and the laws of 
physics determine the flux functions. See Section 2.6 and Chapter 14 for these derivations. 
These equations have been intensively studied and used in countless computations because 
of their importance in aerodynamics and elsewhere. 

There are many other systems of conservation laws that are important in various appli- 
cations, and several are used in this book as examples. However, the Euler equations play a 
special role in the historical development of the techniques discussed in this book. Much of 
the mathematical theory of nonlinear conservation laws was developed with these equations 
in mind, and many numerical methods were developed specifically for this system. So, al- 
though the theory and methods are applicable much more widely, a good knowledge of the 
Euler equations is required in order to read much of the available literature and benefit from 
these developments. A brief introduction is given in Chapter 14. It is a good idea to become 
familiar with these equations even if your primary interest is far from gas dynamics. 



1.1.2 Discontinuous Solutions 

The differential equation (1.8) can be derived from the integral equation (1.10) by simple 
manipulations (see Chapter 2) provided thatq and f(q) are sufficiently smooth. This proviso 
is important because in practice many interesting solutions are not smooth, but contain 
discontinuities such as shock waves. A fundamental feature of nonlinear conservation laws 
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is that these discontinuities can easily develop spontaneously even from smooth initial data, 
and so they must be dealt with both mathematically and computationally. 

At a discontinuity in q, the partial differential equation (1 .8) does not hold in the classical 
sense and it is important to remember that the integral conservation law (1.10) is the more 
fundamental equation which does continue to hold. A rich mathematical theory of shock- 
wave solutions to conservation laws has been developed. This theory is introduced starting 
in Chapter 1 1 . 



1.2 Finite Volume Methods 

Discontinuities lead to computational difficulties and the main subject of this book is the 
accurate approximation of such solutions. Classical finite difference methods, in which 
derivatives are approximated by finite differences, can be expected to break down near 
discontinuities in the solution where the differential equation does not hold. This book 
concerns finite volume methods, which are based on the integral form (1.10) instead of 
the differential equation. Rather than pointwise approximations at grid points, we break 
the domain into grid cells and approximate the total integral of q over each grid cell, or 
actually the cell average of q, which is this integral divided by the volume of the cell. These 
values are modified in each time step by the flux through the edges of the grid cells, and the 
primary problem is to determine good numerical flux functions that approximate the correct 
fluxes reasonably well, based on the approximate cell averages, the only data available. We 
will concentrate primarily on one class of high-resolution finite volume methods that have 
proved to be very effective for computing discontinuous solutions. See Section 6.3 for an 
introduction to the properties of these methods. 

Other classes of methods have also been applied to hyperbolic equations, such as finite 
element methods and spectral methods. These are not discussed directly in this book, 
although much of the material presented here is good background for understanding high- 
resolution versions. 



1.2.1 Riemann Problems 

A fundamental tool in the development of finite volume methods is the Riemann problem, 
which is simply the hyperbolic equation together with special initial data. The data is 
piecewise constant with a single jump discontinuity at some point, say x — 0, 



q(x, 0) 



qi if -V < 0, 

q r if.r > 0. 



(1.11) 



If <2,_ i and Qj are the cell averages in two neighboring grid cells on a finite volume grid, then 
by solving the Riemann problem with qi — <2,_i and q, — Qi, we can obtain information 
that can be used to compute a numerical flux and update the cell averages over a time 
step. For hyperbolic problems the solution to the Riemann problem is typically a similarity 
solution, a function of x/t alone, and consists of a finite set of waves that propagate away 
from the origin with constant wave speeds. For linear hyperbolic systems the Riemann 
problem is easily solved in terms of the eigenvalues and eigenvectors of the matrix A, as 
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developed in Chapter 3. This simple structure also holds for nonlinear systems of equations 
and the exact solution (or arbitrarily good approximations) to the Riemann problem can be 
constructed even for nonlinear systems such as the Euler equations. The theory of nonlinear 
Riemann solutions for scalar problems is developed in Chapter 1 1 and extended to systems 
in Chapter 13. 

Computationally, the exact Riemann solution is often too expensive to compute for non- 
linear problems and approximate Riemann solvers are used in implementing numerical 
methods. These techniques are developed in Section 15.3. 



1.2.2 Shock Capturing vs. Tracking 

Since the PDEs continue to hold away from discontinuities, one possible approach is to 
combine a standard finite difference or finite volume method in smooth regions with some 
explicit procedure for tracking the location of discontinuities. This is the numerical analogue 
of the mathematical approach in which the PDEs are supplemented by jump conditions 
across discontinuities. This approach is often called shock tracking or front tracking. In more 
than one space dimension, discontinuities typically lie along curves (in two dimensions) or 
surfaces (in three dimensions), and such algorithms typically become quite complicated. 
Moreover, in realistic problems there may be many such surfaces that interact in complicated 
ways as time evolves. This approach will not be discussed further in this book. For some 
examples and discussion, see [41], [66], [103], [153], [154], [171], [207], [289], [290], 
[321], [322], [371], [372], 

Instead we concentrate here on shock- capturing methods, where the goal is to capture 
discontinuities in the solution automatically, without explicitly tracking them. Discontinu- 
ities must then be smeared over one or more grid cells. Success requires that the method 
implicitly incorporate the correct jump conditions, reduce smearing to a minimum, and not 
introduce nonphysical oscillations near the discontinuities. High-resolution finite volume 
methods based on Riemann solutions often perform well and are much simpler to implement 
than shock-tracking methods. 



1.3 Multidimensional Problems 

The Riemann problem is inherently one-dimensional, but is extensively used also in the 
solution of multidimensional hyperbolic problems. A two-dimensional finite volume grid 
typically consists of polygonal grid cells; quadrilaterals or triangles are most commonly 
used. A Riemann problem normal to each edge of the cell can be solved in order to determine 
the flux across that edge. In three dimensions each face of a finite volume cell can be 
approximated by a plane, and a Riemann problem normal to this plane solved in order 
to compute the flux. Multidimensional problems are discussed in the Part III of the book, 
starting with an introduction to the mathematical theory in Chapter 18. 

If the finite volume grid is rectangular, or at least logically rectangular, then the simplest 
way to extend one-dimensional high-resolution methods to more dimensions is to use di- 
mensional splitting, a fractional-step approach in which one-dimensional problems along 
each coordinate direction are solved in turn. This approach, which is often surprisingly ef- 
fective in practice, is discussed in Section 19.5. In some cases a more fully multidimensional 
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method is required, and one approach is developed starting in Chapter 20, which again relies 
heavily on our ability to solve one-dimensional Riemann problems. 



1.4 Linear Waves and Discontinuous Media 

High-resolution methods were originally developed for nonlinear problems in order to ac- 
curately capture discontinuous solutions such as shock waves. Linear hyperbolic equations 
often arise from studying small-amplitude waves, where the physical nonlinearities of the 
true equations can be safely ignored. Such waves are often smooth, since shock waves can 
only appear from nonlinear phenomena. The acoustic waves we are most familiar with arise 
from oscillations of materials at the molecular level and are typically well approximated 
by linear combinations of sinusoidal waves at various frequencies. Similarly, most familiar 
electromagnetic waves, such as visible light, are governed by the linear Maxwell equations 
(another hyperbolic system) and again consist of smooth sinusoidal oscillations. 

For many problems in acoustics or optics the primary computational difficulty arises from 
the fact that the domain of interest is many orders of magnitude larger than the wavelengths 
of interest, and so it is important to use a method that can resolve smooth solutions with a 
very high order of accuracy in order to keep the number of grid points required manageable. 
For problems of this type, the methods developed in this book may not be appropriate. These 
finite volume high-resolution methods are typically at best second-order accurate, resulting 
in the need for many points per wavelength for good accuracy. Moreover they have a 
high cost per grid cell relative to simpler finite difference methods, because of the need to 
solve Riemann problems for each pair of grid cells every time step. The combination can 
be disastrous if we need to compute over a domain that spans thousands of wavelengths. 
Instead methods with a higher order of accuracy are typically used, e.g., fourth-order finite 
difference methods or spectral methods. For some problems it is hopeless to try to resolve 
individual wavelengths, and instead ray-tracing methods such as geometrical optics are 
used to determine how rays travel without discretizing the hyperbolic equations directly. 

However, there are some situations in which high-resolution methods based on Riemann 
solutions may have distinct advantages even for linear problems. In many applications wave- 
propagation problems must be solved in materials that are not homogeneous and isotropic. 
The heterogeneity may be smoothly varying (e.g., acoustics in the ocean, where the sound 
speed varies with density, which may vary smoothly with changes in salinity, for example). 
In this case high-order methods may still be applicable. In many cases, however, there are 
sharp interfaces between different materials. If we wish to solve for acoustic or seismic 
waves in the earth, for example, the material parameters typically have jump discontinuities 
where soil meets rock or at the boundaries between different types of rock. Ultrasound waves 
in the human body also pass through many interfaces, between different organs or tissue and 
bone. Even in ocean acoustics there may be distinct layers of water with different salinity, 
and hence jump discontinuities in the sound speed, as well as the interface at the ocean floor 
where waves pass between water and earth. With wave-tracing methods it may be possible 
to use reflection and transmission coefficients and Snell’s law to trace rays and reflected 
rays at interfaces, but for problems with many interfaces this can be unwieldy. If we wish 
to model the wave motion directly by solving the hyperbolic equations, many high-order 
methods can have difficulties near interfaces, where the solution is typically not smooth. 
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For these problems, high-resolution finite volume methods based on solving Riemann 
problems can be an attractive alternative. Finite volume methods are a natural choice for 
heterogeneous media, since each grid cell can be assigned different material properties via 
an appropriate averaging of the material parameters over the volume enclosed by the cell. 
The idea of a Riemann problem is easily extended to the case where there is a discontinuity 
in the medium at x — 0 as well as a discontinuity in the initial data. Solving the Riemann 
problem at the interface between two cells then gives a decomposition of the data into waves 
moving into each cell, including the effects of reflection and transmission as waves move 
between different materials. Indeed, the classical reflection and transmission coefficients 
for various problems are easily derived and understood in terms of particular Riemann 
solutions. Variable-coefficient linear problems are discussed in Chapter 9 and Section 21.5. 

Hyperbolic equations with variable coefficients may not be in conservation form, and 
so the methods are developed here in a form that applies more generally. These wave- 
propagation methods are based directly on the waves arising from the solution of the 
Riemann problem rather than on numerical fluxes at cell interfaces. When applied to con- 
servation laws, there is a natural connection between these methods and more standard 
flux-differencing methods, which will be elucidated as we go along. But many of the 
shock-capturing ideas that have been developed in the context of conservation laws are 
valuable more broadly, and one of my goals in writing this book is to present these meth- 
ods in a more general framework than is available elsewhere, and with more attention to 
applications where they have not traditionally been applied in the past. 

This book is organized in such a way that all of the ideas required to apply the methods 
on linear problems are introduced first, before discussing the more complicated nonlinear 
theory. Readers whose primary interest is in linear waves should be able to skip the nonlinear 
parts entirely by first studying Chapters 2 through 9 (on linear problems in one dimension) 
and then the preliminary parts of Chapters 18 through 23 (on multidimensional problems). 

For readers whose primary interest is in nonlinear problems, I believe that this orga- 
nization is still sensible, since many of the fundamental ideas (both mathematical and 
algorithmic) arise already with linear problems and are most easily understood in this con- 
text. Additional issues arise in the nonlinear case, but these are most easily understood if 
one already has a firm foundation in the linear theory. 



1.5 clawpack Software 

The CLAWPACK software (“conservation-laws package”) implements the various wave- 
propagation methods discussed in this book (in Fortran). This software was originally 
developed as a teaching tool and is intended to be used in conjunction with this book. 
The use of this software is briefly described in Chapter 5, and additional documentation is 
available online, from the webpage 

http : / /www . amath . Washington . edu/~claw 

Virtually all of the computational examples presented in the book were created using 
CLAWPACK, and the source code used is generally available via the website 

http : / /www . amath . Washington . edu/~ claw/book . html 
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A parenthetical remark in the text or figure captions of the form 

[claw/book/chapN/examplename] 

is an indication that accompanying material is available at 

http : / /www . amath . Washington . edu/~claw/book/ chapN/ examplename/www 

often including an animation of time-dependent solutions. From this webpage it is generally 
possible to download a CLAWPACK directory of the source code for the example. Down- 
loading the tarfile and unpacking it in your claw directory results in a subdirectory called 
claw/book/chapN/examplename. (You must first obtain the basic CLAWPACK routines as 
described in Chapter 5.) 

You are encouraged to use this software actively, both to develop an intuition for the 
behavior of solutions to hyperbolic equations and also to develop direct experience with 
these numerical methods. It should be easy to modify the examples to experiment with 
different parameters or initial conditions, or with the use of different methods on the same 
problem. 

These examples can also serve as templates for developing codes for other problems. 
In addition, many problems not discussed in this book have already been solved using 
CLAWPACK and are often available online. Some pointers can be found on the webpages for 
the book, and others are collected within the CLAWPACK software in the applications 
subdirectory; see 

http : // www . amath . Washington . edu/~claw/ apps . html 



1.6 References 

Some references for particular applications and methods are given in the text. There are 
thousands of papers on these topics, and I have not attempted to give an exhaustive survey 
of the literature by any means. The references cited have been chosen because they are 
particularly relevant to the discussion here or provide a good entrance point to the broader 
literature. Listed below are a few books that may be of general interest in understanding 
this material, again only a small subset of those available. 

An earlier version of this book appeared as a set of lecture notes [281]. This contains 
a different presentation of some of the same material and may still be of interest. My 
contribution to [287] also has some overlap with this book, but is directed specifically 
towards astrophysical flows and also contains some description of hyperbolic problems 
arising in magnetohydrodynamics and relativistic flow, which are not discussed here. 

The basic theory of hyperbolic equations can be found in many texts, for example John 
[229], Kevorkian [234], The basic theory of nonlinear conservation laws is neatly presented 
in the monograph of Lax [263]. Introductions to this material can also be found in many 
other books, such as Liu [311], Whitham [486], or Chorin & Marsden [68]. The book of 
Courant & Friedrichs [92] deals almost entirely with gas dynamics and the Euler equations, 
but includes much of the general theory of conservation laws in this context and is very 
useful. The books by Bressan [46], Dafermos [98], Majda [319], Serre [402], Smoller 
[420], and Zhang & Hsiao [499] present many more details on the mathematical theory of 
nonlinear conservation laws. 
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For general background on numerical methods for PDEs, the books of Iserles [211], 
Morton & Mayers [333], Strikwerda [427], or Tveito & Winther [461] are recommended. 
The book of Gustafsson, Kreiss & Oliger [174] is aimed particularly at hyperbolic problems 
and contains more advanced material on well-posedness and stability of both initial- and 
initial-boundary-value problems. The classic book of Richtmyer & Morton [369] contains a 
good description of many of the mathematical techniques used to study numerical methods, 
particularly for linear equations. It also includes a large section on methods for nonlinear 
applications including fluid dynamics, but is out of date by now and does not discuss many 
of the methods we will study. 

A number of books have appeared recently on numerical methods for conservation laws 
that cover some of the same techniques discussed here, e.g., Godlewski & Raviart [156], 
Kroner [245], and Toro [450]. Several other books on computational fluid dynamics are also 
useful supplements, including Durran [117], Fletcher [137], Hirsch [198], Laney [256], 
Oran & Boris [348], Peyret & Taylor [359], and Tannehill, Anderson & Pletcher [445]. 
These books discuss the fluid dynamics in more detail, generally with emphasis on specific 
applications. 

For an excellent collection of photographs illustrating a wide variety of interesting fluid 
dynamics, including shock waves, Van Dyke’s Album of Fluid Motion [463] is highly re- 
commended. 

Many more references on these topics can easily be found these days by searching on the 
web. In addition to using standard web search engines, there are preprint servers that contain 
collections of preprints on various topics. In the field of conservation laws, the Norwegian 
preprint server at 

http : //www . math . ntnu . no/ conservation/ 

is of particular note. Online citation indices and bibliographic databases are extremely useful 
in searching the literature, and students should be encouraged to learn to use them. Some 
useful links can be found on the webpage [claw/book/chapl/] . 



1.7 Notation 

Some nonstandard notation is used in this book that may require explanation. In general I 
use q to denote the solution to the partial differential equation under study. In the literature 
the symbol u is commonly used, so that a general one-dimensional conservation law has 
the form u t + f(u) x = 0, for example. However, most of the specific problems we will 
study involve a velocity (as in the acoustics equations (1.5)), and it is very convenient 
to use u for this quantity (or as the x -component of the velocity vector Tt — (u, v) in two 
dimensions). 

The symbol Q" (in one dimension) or Q'l (in two dimensions) is used to denote the 
numerical approximation to the solution q. Subscripts on Q denote spatial locations (e.g., 
the ;th grid cell), and superscript n denotes time level t n . Often the temporal index is 
suppressed, since we primarily consider one-step methods where the solution at time t n+ \ is 
determined entirely by data at time t„ . When Q or other numerical quantities lack a temporal 
superscript it is generally clear that the current time level t n is intended. 
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For a system of m equations, q and Q are m -vectors, and superscripts are also used to 
denote the components of these vectors, e.g., q p for p= 1,2, , m. It is more convenient 
to use superscripts than subscripts for this purpose to avoid conflicts with spatial indices. 
Superscripts are also used to enumerate the eigenvalues ). p and eigenvectors r p of an m x m 
matrix. Luckily we generally do not need to refer to specific components of the eigenvectors. 
Of course superscripts must also be used for exponents at times, and this will usually be 
clear from context. Initial data is denoted by a circle above the variable, e.g., q(x), rather 
than by a subscript or superscript, in order to avoid further confusion. 

Several symbols play multiple roles in different contexts, since there are not enough 
letters and familiar symbols to go around. For example, i// is used in different places for the 
entropy flux, for source terms, and for stream functions. For the most part these different 
uses are well separated and should be clear from context, but some care is needed to 
avoid confusion. In particular, the index p is generally used for indexing eigenvalues and 
eigenvectors, as mentioned above, but is also used for the pressure in acoustics and gas 
dynamics applications, often in close proximity. Since the pressure is never a superscript, I 
hope this will be clear. 

One new symbol I have introduced is q\qi, q r ) (pronounced perhaps “q Riemann”) to 
denote the value that arises in the similarity solution to a Riemann problem along the ray 
x/t — 0, when the data q\ and q r is specified (see Section 1.2.1). This value is often used in 
defining numerical fluxes in finite volume methods, and it is convenient to have a general 
symbol for the function that yields it. This symbol is meant to suggest the spreading of 
waves from the Riemann problem, as will be explored starting in Chapter 3. Some notation 
specific to multidimensional problems is introduced in Section 18.1. 




Part one 

Linear Equations 




— 2 

Conservation Laws and Differential Equations 



To see how conservation laws arise from physical principles, we will begin by considering 
the simplest possible fluid dynamics problem, in which a gas or liquid is flowing through a 
one-dimensional pipe with some known velocity u(x , t), which is assumed to vary only with 
x , the distance along the pipe, and time t. Typically in fluid dynamics problems we must 
determine the motion of the fluid, i.e., the velocity function u (x , t), as part of the solution, 
but let’s assume this is already known and we wish to simply model the concentration or 
density of some chemical present in this fluid (in very small quantities that do not affect 
the fluid dynamics). Let q{x, t) be the density of this chemical tracer , the function that we 
wish to determine. 

In general the density should be measured in units of mass per unit volume, e.g., grams 
per cubic meter, but in studying the one-dimensional pipe with variations only in x, it is 
more natural to assume that q is measured in units of mass per unit length, e.g., grams per 
meter. This density (which is what is denoted by q here) can be obtained by multiplying the 
three-dimensional density function by the cross-sectional area of the pipe (which has units 
of square meters). Then 




q(x , t) dx 



( 2 . 1 ) 



represents the total mass of the tracer in the section of pipe between x \ and X 2 at the particular 
time t, and has the units of mass. In problems where chemical kinetics is involved, it is 
often necessary to measure the “mass” in terms of moles rather than grams, and the density 
in moles per meter or moles per cubic meter, since the important consideration is not the 
mass of the chemical but the number of molecules present. For simplicity we will speak in 
terms of mass, but the conservation laws still hold in these other units. 

Now consider a section of the pipe x\ < x < X 2 and the manner in which the integral 
(2.1) changes with time. If we are studying a substance that is neither created nor destroyed 
within this section, then the total mass within this section can change only due to the flux 
or flow of particles through the endpoints of the section at x\ and aa . Let Fflt) be the rate 
at which the tracer flows past the fixed point x, for ( = 1,2 (measured in grams per second, 
say). We use the convention that Fflt) > 0 corresponds to flow to the right, while Fflt) < 0 
means a leftward flux, of | F, (f) grams per second. Since the total mass in the section [xi , xfl] 
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changes only due to fluxes at the endpoints, we have 



d 

dt 




q(x, t)dx — F\(t) — F 2 (t). 



(2.2) 



Note that +F\ (t) and — F 2 (t) both represent fluxes into this section. 

The equation (2.2) is the basic integral form of a conservation law, and equations of this 
type form the basis for much of what we will study. The rate of change of the total mass 
is due only to fluxes through the endpoints - this is the basis of conservation. To proceed 
further, we need to determine how the flux functions Eft) are related to q(x, t), so that we 
can obtain an equation that might be solvable for q. In the case of fluid flow as described 
above, the flux at any point x at time t is simply given by the product of the density q(x, t) 
and the velocity u(x. t): 



flux at (x, t) — u(x, t)q(x, t). (2.3) 

The velocity tells how rapidly particles are moving past the point x (in meters per second, 
say), and the density q tells how many grams of chemical a meter of fluid contains, so the 
product, measured in grams per second, is indeed the rate at which chemical is passing this 
point. 

Since u(x,t) is a known function, we can write this flux function as 

flux = f (q, x, t ) = n(x, t)q. (2.4) 

In particular, if the velocity is independent of x and t, so u(x, t) = u is some constant, then 
we can write 



flux = f{q) — uq. 



(2.5) 



In this case the flux at any point and time can be determined directly from the value of 
the conserved quantity at that point, and does not depend at all on the location of the point 
in space-time. In this case the equation is called autonomous. Autonomous equations will 
occupy much of our attention because they arise in many applications and are simpler to 
deal with than nonautonomous or variable-coefficient equations, though the latter will also 
be studied. 

For a general autonomous flux f(q) that depends only on the value of q , we can rewrite 
the conservation law (2.2) as 



d 

dt 




q(x, t)dx = f(q(x u t)) - f(q(x 2 , t)). 



(2.6) 



The right-hand side of this equation can be rewritten using standard notation from calculus: 



d 

dt 




q(x, t)dx — —f(q(x, t)) 



X2 



*1 



(2.7) 



This shorthand will be useful in cases where the flux has a complicated form, and also 
suggests the manipulations performed below, leading to the differential equation for q. 
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Once the flux function f(q) is specified, e.g., by (2.5) for the simplest case considered 
above, we have an equation for q that we might hope to solve. This equation should hold 
over every interval [xi , X 2 ] for arbitrary values of x\ and xi. It is not clear how to go about 
finding a function q(x.t) that satisfies such a condition. Instead of attacking this problem 
directly, we generally transform it into a partial differential equation that can be handled 
by standard techniques. To do so, we must assume that the functions q(x, t) and f(q ) are 
sufficiently smooth that the manipulations below are valid. This is very important to keep 
in mind when we begin to discuss nonsmooth solutions to these equations. 

If we assume that q and f are smooth functions, then this equation can be rewritten as 



d 

dt 



r Xi r Xi 9 

/ q(x, t)dx — — / — f(q(x,t))dx, 

Jx, Jx, d* 



or, with some further modification, as 

9 



/ 

J X\ 



q(x, t) + —f(q(x, t )) 
dt dx 



dx — 0. 



(2.8) 



(2.9) 



Since this integral must be zero for all values of x\ and xi, it follows that the integrand must 
be identically zero. This gives, finally, the differential equation 



9 

9r 



q(x, t) + 



9 

dx 



f (q (x , t)) = 0 . 



(2.10) 



This is called the differential form of the conservation laws. Partial differential equations 
(PDEs) of this type will be our main focus. Partial derivatives will usually be denoted by 
subscripts, so this will be written as 



q,(x. t) + f (q(x , t)) x = 0. 



(2.11) 



2.1 The Advection Equation 

For the flux function (2.5), the conservation law (2.10) becomes 



q, + uq x = 0. (2.12) 

This is called the advection equation , since it models the advection of a tracer along with 
the fluid. By a tracer we mean a substance that is present in very small concentrations 
within the fluid, so that the magnitude of the concentration has essentially no effect on the 
fluid dynamics. For this one-dimensional problem the concentration (or density) q can be 
measured in units such as grams per meter along the length of the pipe, so that f Xl q(x, t) dx 
measures the total mass (in grams) within this section of pipe. In Section 9.1 we will 
consider more carefully the manner in which this is measured and the form of the resulting 
advection equation in more complicated cases where the diameter of the pipe and the fluid 
velocity need not be constant. 

Equation (2.12) is a scalar, linear, constant-coefficient PDE of hyperbolic type. The 
general solution of this equation is very easy to determine. Any smooth function of the 
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form 



q(x, t) — q(x — ut) (2.13) 

satisfies the differential equation (2.12), as is easily verified, and in fact any solution to 

(2.12) is of this form for some q. Note that q(x, t) is constant along any ray in space-time 
for which x — iit — constant. For example, all along the ray X(t) = xq + ut the value of 
q(X(t), t) is equal to q(x o). Values of q simply advect (i.e., translate) with constant velocity 
m, as we would expect physically, since the fluid in the pipe (and hence the density of 
tracer moving with the fluid) is simply advecting with constant speed. These rays X(t ) are 
called the characteristics of the equation. More generally, characteristic curves for a PDE 
are curves along which the equation simplifies in some particular manner. For the equation 

(2.12) , we see that along X(t) the time derivative of q(X(t), t) is 

~ 7 ~q{X{t), t) = q t (X(t), t) + X'(t)q x (X(t), t) 
dt 

= q, + uq x 

= 0. (2.14) 

and the equation (2.12) reduces to a trivial ordinary differential equation jjQ — 0, where 
<2(0 = q(X(t), t). This again leads to the conclusion that q is constant along the charac- 
teristic. 

To find the particular solution to (2.12) of interest in a practical problem, we need more 
information in order to determine the particular function q in (2.13): initial conditions and 
perhaps boundary conditions for the equation. First consider the case of an infinitely long 
pipe with no boundaries, so that (2. 12) holds for — oo < a' < oo. Then to determine q(x, t) 
uniquely for all times t > to we need to know the initial condition at time to, i.e., the initial 
density distribution at this particular time. Suppose we know 

q(x, t 0 ) — q(x), (2.15) 

where q(x) is a given function. Then since the value of q must be constant on each charac- 
teristic, we can conclude that 



q(x, t) — q(x — u(t — to)) 

for t > to- The initial profile q simply translates with speed u. 

If the pipe has finite length, a < x < b, then we must also specify the density of tracer 
entering the pipe as a function of time, at the inflow end. For example, if u > 0 then we 
must specify a boundary condition at x = a, say 

q(a, t) = g 0 (t) for t > to 

in addition to the initial condition 



q(x, t) = q(x) for a < x < b. 
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Fig. 2. 1 . The solution to the advection equation is constant along characteristics. When solving this 
equation on the interval [a, b\, we need boundary conditions at x = a if u > 0 as shown in (a), or at 
x = h if u <0 as shown in (b). 



The solution is then 



q(x , t ) 



go(t — ( x — a)/u) if a < x < a + u(t — to), 
q(x — u(t — to)) if a + u{t — to) < x < b. 



Note that we do not need to specify a boundary condition at the outflow boundary x = b 
(and in fact cannot, since the density there is entirely determined by the data given already). 

If on the other hand u < 0, then flow is to the left and we would need a boundary 
condition at x — b rather than at x — a. Figure 2.1 indicates the flow of information along 
characteristics for the two different cases. The proper specification of boundary conditions 
is always an important part of the setup of a problem. 

From now on, we will generally take the initial time to be t = 0 to simplify notation, but 
everything extends easily to general t 0 - 



2.1.1 Variable Coefficients 

If the fluid velocity u varies with x, then the flux (2.4) leads to the conservation law 



q, + (u(x)q) x = 0. 



(2.16) 



In this case the characteristic curves X(t) are solutions to the ordinary differential equations 

X\t) = u{X(t)). (2.17) 

Starting from an arbitrary initial point xq, we can solve the equation (2.17) with initial 
condition X ( 0 ) = xq to obtain a particular characteristic curve X(t). Note that these curves 
track the motion of particular material particles carried along by the fluid, since their velocity 
at any time matches the fluid velocity. Along a characteristic curve we find that the advection 
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q,{X(t),t) + X'{t)q x {X{t),t) 

q, + u(X(t))q x 
q, + ( u(X(t))q) x - u'(X(t))q 

-u'(X(t))q(X(t),t). (2.18) 

Note that when u is not constant, the curves are no longer straight lines and the solution q 
is no longer constant along the curves, but still the original partial differential equation has 
been reduced to solving sets of ordinary differential equations. 

The operator 3, + ud x is often called the material derivative , since it represents differen- 
tiation along the characteristic curve, and hence computes the rate of change observed by a 
material particle moving with the fluid. 

The equation (2. 16) is an advection equation in conservation form. In some applications 
it is more natural to derive a nonconservative advection equation of the form 

q, + u(x)q x = 0. (2.19) 

Again the characteristic curves satisfy (2.17) and track the motion of material points. For 
this equation the second line of the right-hand side of (2.18) reduces to zero, so that q is 
now constant along characteristic curves. Which form (2.16) or (2.19) arises often depends 
simply on what units are used to measure physical quantities, e.g., whether we measure 
concentration in grams per meter as was assumed above (giving (2.16)), or whether we use 
grams per cubic meter, as might seem to be a more reasonable definition of concentration 
in a physical fluid. The latter choice leads to (2.19), as is discussed in detail in Chapter 9, 
and further treatment of variable-coefficient problems is deferred until that point. 

2.2 Diffusion and the Advection-Diffusion Equation 

Now suppose that the fluid in the pipe is not flowing, and has zero velocity. Then according 
to the advection equation, q, = 0 and the initial profile q(x) does not change with time. 
However, if q is not constant in space, then in fact it should still tend to slowly change 
due to molecular diffusion. The velocity u should really be thought of as a mean velocity , 
the average velocity that the roughly 10 23 molecules in a given drop of water have. But 
individual molecules are bouncing around in different directions, and so molecules of the 
substance we are tracking will tend to get spread around in the water, as a drop of ink spreads. 
There will tend to be a net motion from regions where the density is large to regions where 
it is smaller. Fick’s law of diffusion states that the net flux is proportional to the gradient 
of q, which in one space dimension is simply the derivative q x . The flux at a point x now 
depends on the value of q x at this point, rather than on the value of q, so we write 

flux of q = f(q x ) = -fiq x , (2.20) 

where f is the diffusion coefficient. Using this flux in (2.10) gives 

q, = fqxx, ( 2 . 21 ) 

which is known as the diffusion equation. 



equation (2.16) simplifies: 
d 



dt 



q(X(t), t) = 
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In some problems the diffusion coefficient may vary with x. Then / = —/3(x)q x and the 
equation becomes 



q, = (0 (x)q x ) x . (2.22) 

Returning to the example of fluid flow, more generally there would be both advection 
and diffusion occurring simultaneously. Then the flux is f(q, q x ) — uq — fq x , giving the 
advection— diffusion equation 



q t + uq x = f>q xx . (2.23) 

The diffusion and advection-diffusion equations are examples of the general class of 
PDEs called parabolic. 



2.3 The Heat Equation 

The equation (2.21) (or more generally (2.22)) is often called the heat equation , for heat 
diffuses in much the same way as a chemical concentration. In the case of heat, there may 
be no net motion of the material, but thermal vibration of molecules causes neighboring 
molecules to vibrate and this internal energy diffuses through the material. Let q(x,t) now 
be the temperature of the material at point x (e.g., a metal rod, since we are in one space 
dimension). The density of internal energy at point x is then given by 

E(x, t) = K(x)q{x, t ), 

where k(x) is the heat capacity of the material at this point. It is this energy that is conserved, 
and hence varies in a test section [x \ , X 2 ] only due to the flux of energy past the endpoints. 
The heat flux is given by Fourier’s law of heat conduction. 



flux = —f>q x . 



where f is the coefficient of thermal conductivity. This looks identical to Fick’s law for 
diffusion, but note that Fourier’s law says that the energy flux is proportional to the temper- 
ature gradient. If the heat capacity is identically constant, say k = 1 , then this is identical 
to Fick’s law, but there is a fundamental difference if k varies. Equation (2.22) is the heat 
equation when k = 1 . More generally the heat equation is derived from the conservation law 



d 

dt 




K(x)q(x, t)dx — —f(x)q x (x, t) 



X2 



*1 



(2.24) 



and has the differential form 



(Kq)t = (fq.x)x- 

Typically k does not vary with time and so this can be written as 



(2.25) 



Kq t — (Pq x ) x - 



(2.26) 
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2.4 Capacity Functions 

In the previous section we saw how the heat capacity comes into the conservation law for 
heat conduction. There are also other situations where a “capacity” function naturally arises 
in the derivation of a conservation law, where again the flux of a quantity is naturally defined 
in terms of one variable q , whereas it is a different quantity Kq that is conserved. If the flux 
function is f(q), then the obvious generalization of (2.24) yields the conservation law 

«q t + f(q)x = 0. (2.27) 

While it may be possible to incorporate k into the definition of fiq), it is often preferable 
numerically to work directly with the form (2.27). This is discussed in Section 6.16 and is 
useful in many applications. In fluid flow problems, k might represent the capacity of the 
medium to hold fluid. For flow through a pipe with a varying diameter, k{x) might be the 
cross-sectional area, for example (see Section 9.1). For flow in porous media, k would be 
the porosity, the fraction of the medium available to fluid. On a nonuniform grid a capacity 
k appears in the numerical method that is related to the size of a physical grid cell; see 
Section 6.17 and Chapter 23. 



2.5 Source Terms 

In some situations / ' 2 q (x , t) dx changes due to effects other than flux through the endpoints 
of the section, if there is some source or sink of the substance within the section. Denote 
the density function for such a source by f/(q, x, t). (Negative values of i jr correspond to a 
sink rather than a source.) Then the equation becomes 

d f X2 f X2 8 [ Xl 

— / q(x,t)dx— / — f(q(x,t))dx + I i/r(q(x,t),x,t)dx. 

dt J X1 J x , dx J XI 

This leads to the PDE 

q t(x, t ) + f(q(x, t)) x = i Hq(x, t ), x, t ). (2.28) 

In this section we mention only a few effects that lead to source terms. Conservation laws 
with source terms are more fully discussed in Chapter 17. 



2.5.1 External Heat Sources 

As one example, consider heat conduction in a rod as in Section 2.3, with k = 1 and ft = 
constant, but now suppose there is also an external energy source distributed along the rod 
with density i Jr. Then we obtain the equation 

q,(x, t) = ftq xx (x, t) + \ft(x, t). 

This assumes the heat source is independent of the current temperature. In some cases 
the strength of the source may depend on the value of q. For example, if the rod is immersed 
in a liquid that is held at constant temperature qo, then the flux of heat into the rod at the 
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point (x, t) is proportional to qo — q(x, t) and the equation becomes 
q r (x, t ) = Pq xx (x, t ) + D(qo - q(x, /)), 
where D is the conductivity coefficient between the rod and the bath. 



2.5.2 Reacting Flow 

As another example, consider a fluid flowing through a pipe at constant velocity as in 
Section 2.1, but now suppose there are several different chemical species being advected 
in this flow (in minute quantities compared to the bulk fluid). If these chemicals react with 
one another, then the mass of each species individually will not be conserved, since it is 
used up or produced by the chemical reactions. We will have an advection equation for each 
species, but these will include source terms arising from the chemical kinetics. 

As an extremely simple example, consider the advection of a radioactive isotope with 
concentration measured by q 1 , which decays spontaneously at some rate a into a different 
isotope with concentration q 1 . If this decay is taking place in a fluid moving with velocity 
m, then we have a system of two advection equations with source terms: 



qj + uq\ = —aq l , 
q; + uql = +aq 1 . 



(2.29) 



This has the form q, + Aq x — \j/(q ). in which the coefficient matrix A is diagonal with both 
diagonal elements equal to u . This is a hyperbolic system, with a source term. More generally 
we might have m species with various chemical reactions occurring simultaneously between 
them. Then we would have a system of m advection equations (with diagonal coefficient 
matrix A = iil) and source terms given by the standard kinetics equations of mass action. 

If there are spatial variations in concentrations, then these equations may be augmented 
with diffusion terms for each species. This would lead to a system of reaction-advection- 
diffusion equations of the form 



q, + Aq x = Pq xx + i Jr(q). (2.30) 

The diffusion coefficient could be different for each species, in which case /I would be a 
diagonal matrix instead of a scalar. 

Other types of source terms arise from external forces such as gravity or from geometric 
transformations used to simplify the equations. See Chapter 17 for some other examples. 



2.6 Nonlinear Equations in Fluid Dynamics 

In the pipe-flow model discussed above, the function q (x, t ) represented the density of some 
tracer that was carried along with the fluid but was present in such small quantities that 
the distribution of q has no effect on the fluid velocity. Now let’s consider the density of 
the fluid itself, again in grams per meter, say, for this one-dimensional problem. We will 
denote the fluid density by the standard symbol p(x, t). If the fluid is incompressible (as 
most liquids can be assumed to be for most purposes), then p(x, t) is constant and this 
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one-dimensional problem is not very interesting. If we consider a gas, however, then the 
molecules are far enough apart that compression or expansion is possible and the density 
may vary from point to point. 

If we again assume that the velocity u is constant, then the density p will satisfy the same 
advection equation as before (since the flux is simply up and m is constant), 



p, + up x = 0, (2.31) 

and any initial variation in density will simply translate at speed u. However, this is not 
what we would expect to happen physically. If the gas is compressed in some region (i.e., 
the density is higher here than nearby) then we would expect that the gas would tend to 
push into the neighboring gas, spreading out, and lowering the density in this region while 
raising the density nearby. (This does in fact happen provided that the pressure is also higher 
in this region; see below.) In order for the gas to spread out it must move relative to the 
neighboring gas, and hence we expect the velocity to change as a result of the variation in 
density. 

While previously we assumed the tracer density q had no effect on the velocity, this 
is no longer the case. Instead we must view the velocity uix , t) as another unknown to 
be determined along with p(x , t). The density flux still takes the form (2.3), and so the 
conservation law for p has the form 



Pt + (pu)x = 0, (2.32) 

which agrees with (2.31) only if u is constant. This equation is generally called the continuity 
equation in fluid dynamics, and models the conservation of mass. 

In addition to this equation we now need a second equation for the velocity. The velocity 
itself is not a conserved quantity, but the momentum is. The product p(x, t)u(x , t) gives 
the density of momentum, in the sense that the integral of pu between any two points x\ 
and a '2 yields the total momentum in this interval, and this can change only due to the flux 
of momentum through the endpoints of the interval. The momentum flux past any point x 
consists of two parts. First there is momentum carried past this point along with the moving 
fluid. For any density function q this flux has the form qu, as we have already seen at the 
beginning of this chapter, and so for the momentum q — pu this contribution to the flux is 
(pu)u — pu 2 . This is essentially an advective flux, although in the case where the quantity 
being advected is the velocity or momentum of the fluid itself, the phenomenon is often 
referred to as convection rather than advection. 

In addition to this macroscopic convective flux, there is also a microscopic momentum 
flux due to the pressure of the fluid, as described in Section 14.1. This enters into the 
momentum flux, which now becomes 

momentum flux = pu 2 + p. 



The integral form of the conservation law (2.7) is then 



d 

dt 



f 

J X\ 



p(x, t)u(x, t)dx — —[pu 2 + p] x f. 



(2.33) 
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Note that it is only a difference in pressure between the two ends of the interval that will 
cause a change in the net momentum, as we would expect. We can think of this pressure 
difference as a net force that causes an acceleration of the fluid, though this isn’t strictly 
correct and a better interpretation is given in Section 14.1. 

If we assume that p, u, and p are all smooth, then we obtain the differential equation 

(pu), + ( pu 2 + p) x = 0, (2.34) 

modeling conseiyation of momentum. Combining this with the continuity equation (2.32), 
we have a system of two conservation laws for the conservation of mass and momentum. 
These are coupled equations, since p and pu appear in both. They are also clearly nonlinear, 
since products of the unknowns appear. 

In developing the conservation law for pu we have introduced a new unknown, the 
pressure p(x, t). It appears that we need a third differential equation for this. Pressure is not 
a conserved quantity, however, and so instead we introduce a fourth variable, the energy, 
and an additional equation for the conservation of energy. The density of energy will be 
denoted by E(x, t). This still does not determine the pressure, and to close the system we 
must add an equation of state, an algebraic equation that determines the pressure at any 
point in terms of the mass, momentum, and energy at the point. The energy equation and 
equations of state will be discussed in detail in Chapter 14, where we will derive the full 
system of three conservation laws. 

For the time being we consider special types of flow where we can drop the conservation- 
of-energy equation and use a simpler equation of state that determines p from p alone. For 
example, if no shock waves are present, then it is often correct to assume that the entropy of 
the gas is constant. Such a flow is called isentropic. This is discussed further in Chapter 14. 
This assumption is reasonable in particular if we wish to derive the equations of linear 
acoustics, which we will do in the next section. In this case we look at very small-amplitude 
motions (sound waves) and the flow remains isentropic. In the isentropic case the equation 
of state is simply 



P = i<p y = P{p), 

where k and y are two constants (with yf&lA for air). 

More generally we could assume an equation of state of the form 



(2.35) 



P = Pip), (2.36) 

where P(p) is a given function specifying the pressure in terms of density. To be physically 
realistic we can generally assume that 

P'(p) > 0 for p > 0. (2.37) 

This matches our intuition (already used above) that increasing the density of the gas will 
cause a corresponding increase in pressure. Note that the isentropic equation of state (2.35) 
has this property. We will see below that the assumption (2.37) is necessary in order to 
obtain a hyperbolic system. 
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Using the equation of state (2.36) in (2.34), together with the continuity equation (2.32), 
gives a closed system of two equations: 



Pi + ( pu) x = 0, 
(pu), + (. pu 2 + P(p)) x = 0. 



(2.38) 



This is a coupled system of two nonlinear conservation laws, which we can write in the 
form 



q, + f(q) X = o 



(2.39) 



if we define 





P 




[VI 




pu 




q 2 


q = 


pu 


— 




- f(q) = 


pu 2 + P(p) 


— 


{q 2 ) 2 /q X + P{q l ) 



More generally, a system of m conservation laws takes the form (2.39) with q e M m and 
/ : R"' — > R'”. The components of / are the fluxes of the respective components of q, and 
in general each flux may depend on the values of any or all of the conserved quantities at 
that point. 

Again it should be stressed that this differential form of the conservation law is derived 
under the assumption that q is smooth, from the more fundamental integral form. Note that 
when q is smooth, we can also rewrite (2.39) as 

q, + f'(q)q, = 0, (2.41) 

where f(q ) is the Jacobian matrix with (;, j) entry given by df/dqj. The form (2.41) is 
called the quasilinear form of the equation, because it resembles the linear system 

q, + Aq x = 0, (2.42) 

where A is a given m x m matrix. In the linear case this matrix does not depend on q , while 
in the quasilinear equation (2.41) it does. A thorough understanding of linear systems of the 
form (2.42) is required before tackling nonlinear systems, and the first 10 chapters concern 
only linear problems. There is a close connection between these theories, and the Jacobian 
matrix f r (q) plays an important role in the nonlinear theory. 



2.7 Linear Acoustics 

In general one can always obtain a linear system from a nonlinear problem by linearizing 
about some state. This amounts to defining A = f'(qo) for some fixed state go in the linear 
system (2.42), and gives a mathematically simpler problem that is useful in some situations, 
particularly when the interest is in studying small perturbations about some constant state. 

To see how this comes about, suppose we wish to model the propagation of sound waves 
in a one-dimensional tube of gas. An acoustic wave is a very small pressure disturbance that 
propagates through the compressible gas, causing infinitesimal changes in the density and 
pressure of the gas via small motions of the gas with infinitesimal values of the velocity u. 
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Our eardrums are extremely sensitive to small changes in pressure and translate small 
oscillations in the pressure into nerve impulses that we interpret as sound. Consequently, 
most sound waves are essentially linear phenomena: the magnitudes of disturbances from 
the background state are so small that products or powers of the perturbation amplitude 
can be ignored. As linear phenomena, they also do not involve shock waves, and so a 
linearization of the isentropic equations introduced above is suitable. (An exception is the 
“sonic boom” caused by supersonic aircraft - this is a nonlinear shock wave, or at least 
originates as such.) 

To perform the linearization of (2.40), let 

q(x, t) = qo + q(x, t), (2.43) 

where qo = (po> PoMo) is the background state we are linearizing about and q is the pertur- 
bation we wish to determine. Typically uq = 0, but it can be nonzero if we wish to study 
the propagation of sound in a constant-strength wind, for example. Using (2.43) in (2.1 1) 
and discarding any terms that involve powers or products of the q variables, we obtain the 
linearized equations 

q, + f(qo)q, = 0 (2.44) 



This is a constant-coefficient linear system modeling the evolution of small disturbances. 

To obtain the acoustics equations, we compute the Jacobian matrix for the simplified 
system of gas dynamics (2.38). Differentiating the flux function from (2.40) gives 



9/W d.f l /dq 2 ' 
df 2 /dq l df 2 /dq 2 _ 

0 1 

-(q 2 ) 2 /(q X ) 2 +P'(q X ) 2 q 2 /q x 

0 1 ’ 

— M 2 + P'(fi) 2u 



(2.45) 



The equations of linear acoustics thus take the form of a constant-coefficient linear system 
(2.44) with 



A = f(q 0 ) = 



0 

-Kq + p '(Po) 



1 

2mq 



(2.46) 



Note that the vector q in the system (2.44) has components p and pu, the perturbation of 
density and momentum. When written out in terms of its components, the system is 



Pt + (pu) x = 0 
(pu), +(— + P'(Po))Px + 2mq (pu) x = 0. 



(2.47) 



Physically it is often more natural to model perturbations u and p in velocity and pressure, 
since these can often be measured directly. To obtain such equations, first note that pressure 
perturbations can be related to density perturbations through the equation of state, 



Po + P — P(po + P) — P(Po) + P'(Po)P + • • • > 
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and since po = P(po), we obtain 



P ^ P'(Po)P- 



Also we have 



and so 



pu = ( po + p)(u o + u ) — pouo + pu o + Pqu + pu, 



pu uop + Pqu. 

Using these expressions in the equations (2.47) and performing some manipulations 
(Exercise 2.1) leads to the alternative form of the linear acoustics equations 



Pt + UQp x + K 0 u x = 0, 
Pqu, + p x + PquqUx = 0, 

where 



(2.48) 



Ko — poP'(po). 



(2.49) 



The equations (2.48) can be written as a linear system 



" p~ 


4- 


Uq 


K 0 ' 


" P' 


u 


i 

t 


_ 1/Po 


Uq _ 


u 



Here and from now on we will generally drop the tilde on p and u and use 



q(x, t) = 



p(x, t ) 
u(x, t ) 



(2.50) 



to denote the pressure and velocity perturbations in acoustics. 

The system (2.50) can also be derived by first rewriting the conservation laws (2.38) as 
a nonconservative set of equations for u and p, which is valid only for smooth solutions, 
and then linearizing this system; see Exercise 2.2. 

An important special case of these equations is obtained by setting uq = 0, so that we 
are linearizing about the motionless state. In this case the coefficient matrix A appearing in 
the system (2.50) is 



and the equations reduce to 




K 0 

0 



Pt + K 0 u x — 0, 
Poll, + p x = 0. 



(2.51) 



(2.52) 



In Section 2.12 we will see that essentially the same set of equations can be derived for 
one-dimensional acoustics in an elastic solid. The parameter Kq is called the bulk modulus 
of compressibility of the material; see Section 22.1.2 for more about this parameter. 
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2.8 Sound Waves 

If we solve the equations just obtained for linear acoustics in a stationary gas, we expect the 
solution to consist of sound waves propagating to the left and right. Since the equations are 
linear, we should expect that the general solution consists of a linear superposition of waves 
moving in each direction, and that each wave propagates at constant speed (the speed of 
sound) with its shape unchanged. This suggests looking for solutions to the system (2.52) 
of the form 



q(x, t) — q(x — st) 

for some speed s, where c/(£ ) is some function of one variable. With this Ansatz we compute 
that 



q t (x, t) = —sq'(x — st), q x {x, t) = q'{x — st), 
and so the equation q t + Aq x — 0 reduces to 



Aq\x — st) = sq\x — st). (2.53) 

Since s is a scalar while A is a matrix, this is only possible if ,v is an eigenvalue of the matrix 
A, and q'{f ) must also be a corresponding eigenvector of A for each value of f . Make sure 
you understand why this is so, as this is a key concept in understanding the structure of 
hyperbolic systems. 

For the matrix A in (2.5 1 ) we easily compute that the eigenvalues are 

A 1 = —Co and A 2 = +cq, (2.54) 



where 



co = s/ Kq/pq, (2.55) 

which must be the speed of sound in the gas. As expected, waves can propagate in either 
direction with this speed. Recalling (2.49), we see that 

co = v 7 P'(Po)- (2.56) 

The intuitively obvious assumption (2.37) (that pressure increases with density so that 
P'(p) > 0) turns out to be important mathematically in order for the speed of sound co to 
be a real number. 

For the more general coefficient matrix A of (2.50) with uq f- 0, the eigenvalues are 
found to be 



A 1 = uq — co and A 2 = u 0 + Co- (2.57) 

When the fluid is moving with velocity uq, sound waves still propagate at speed ±co relative 
to the fluid, and at velocities A 1 and A 2 relative to a fixed observer. (See Figure 3.7.) 
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Regardless of the value of mq, the eigenvectors of the coefficient matrix are 



— poCo 


,2 _ 


Pqcq 


1 


, / 


1 



(2.58) 



Any scalar multiple of each vector would still be an eigenvector. We choose the particular 
normalization of (2.58) because the quantity 



Zq = Pqcq (2.59) 

is an important parameter in acoustics, called the impedance of the medium. 

A sound wave propagating to the left with velocity —cq must have the general form 

q(x, t) — w l (x + CQt)r x (2.60) 

for some scalar function rZ> 1 (§■), so that 

q(x , t) — u> l (x + Coty 1 = q(x + cot ) 

and hence q'{£ ) is a scalar multiple of r 1 as required by (2.53) for s = — c o. In terms of the 
components of q this means that 

p(x, t) = — Znw l (x + Cot), 

. (2.61) 

u(x, t ) = w (x + Cot). 

We see that in a left-going sound wave the pressure and velocity perturbations are always 
related by p = —Zqu. Analogously, in a right-going sound wave p = +Zqu everywhere 
and q(x , f) = w 2 (x — cot)r 2 for some scalar function w 2 ( §). (See Figure 3.1.) 

The general solution to the acoustic equations consists of a superposition of left-going 
and right-going waves, and has 

q(x , t) = tfi'Cr + cot)r l + U) 2 (x — cot)r 2 (2.62) 

for some scalar functions w ] (f ) and w 2 (f). Exactly what these functions are will depend 
on the initial data given for the problem. Let 

q{x, 0) = q{x) — 

be the pressure and velocity perturbation at time t — 0. To compute the resulting solution 
q(x, t) we need to determine the scalar functions w 1 and vu 2 in (2.62). To do so we can 
evaluate (2.62) at time t — 0 and set this equal to the given data q, obtaining 

w l (x)r l + u> 2 {x)r 2 — q(x). 

At each point x this gives a 2 x 2 linear system of equations to solve for w l {x) and u> 2 (x) 
at this particular point (since the vectors r 1 , r 2 , and cj(x) are all known). Let 



p(x) 

u(x) 



R — [ r ‘k 2 ] 



(2.63) 
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be the 2x2 matrix with columns r 1 and r 2 . Then this system of equations can be written as 



Rw{x) — q(x), 



(2.64) 



where w(x) is the vector with components w (x ) and uf(x). For acoustics the matrix R is 



R = 



— Zo Zo 
1 1 



(2.65) 



which is a nonsingular matrix provided Zo > 0 as it will be in practice. The solution to 
(2.64) can be found in terms of the inverse matrix 



R~ l = 



1 

2Zn 



-1 Z 0 
1 Z 0 



(2.66) 



We find that 



w\x)= 2-[-p(x) + Zqu(x)], 
ZZq 

W 2 (x ) = -^r[p(x) + Z 0 u(x)]. 
ZZo 



(2.67) 



The solution (2.62) then becomes 



1 Zo 

p(x, t) = -[ p(x + cot ) + p(x - c 0 f)] — — [u(x + cot ) - u(x - c 0 r)], 

ll(x, t ) = ———[p(x + Cot) - p(x - Co?)] + ~[u(x + Cot) + u(x - Cot)]. 
2 Zq 2 



(2.68) 



2.9 Hyperbolicity of Linear Systems 

The process we have just gone through to solve the acoustics equations motivates the 
definition of a first-order hyperbolic system of partial differential equations. This process 
generalizes to solve any linear constant-coefficient hyperbolic system. 

Definition 2.1. A linear system of the form 

q, + Aq x = 0 (2.69) 

is called hyperbolic if the m x m matrix A is diagonalizable with real eigenvalues. 

We denote the eigenvalues by 



< X 2 <■■■ < X m . 

The matrix is diagonalizable if there is a complete set of eigenvectors, i.e., if there are 
nonzero vectors r 1 , r 2 , . . . , r m e R m such that 



Ar p = X p r p for p = 1,2 , ... ,m. 



(2.70) 
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and these vectors are linearly independent. In this case the matrix 

R = [r l \r 2 \ ■ ■ ■ \r m ], (2.71) 

formed by collecting the vectors r l ,r 2 , , r m together, is nonsingular and has an inverse 
R 1 . We then have 



R~ X AR = A and A = RAR~\ 



(2.72) 



where 



A = 



l 1 

X 2 



= diag(k', X 2 , . . . , X m ). 



X 



m 



Hence we can bring A to diagonal form by a similarity transformation, as displayed in 
(2.72). The importance of this from the standpoint of the PDE is that we can then rewrite 
the linear system (2.69) as 



R~ l q, + R~ l ARR~ l q x = 0. (2.73) 

If we define w(x, 1 ) = R ' q (x . t), then this takes the form 



w, + A w x — 0. (2.74) 

Since A is diagonal, this system decouples into m independent advection equations for the 
components w p of w: 



w't + = 0 for p = 1, 2, . . . , m. (2.75) 

Since each X p is real, these advection equations make sense physically and can be used to 
solve the original system of equations (2.69). Complete details are given in the next chapter, 
but clearly the solution will consist of a linear combination of m “waves” traveling at the 
characteristic speeds X 1 , X 2 , ... , X m . (Recall that eigenvalues are also sometimes called 
“characteristic values.”) These values define the characteristic cuives X(t) = xq + X p t 
along which information propagates in the decoupled advection equations. The functions 
w p (x, t) are called the characteristic variables; see Section 3.2. 

There are some special classes of matrices A for which the system is certainly hyperbolic. 
If A is a symmetric matrix (A = A r ), then A is always diagonalizable with real eigenvalues 
and the system is said to be symmetric hyperbolic. Also, if A has distinct real eigenvalues 
X 1 < X 2 <■■■ < X m , then the eigenvectors must be linearly independent and the system is 
hyperbolic. Such a system is called strictly hyperbolic. The equations of linear acoustics are 
strictly hyperbolic, for example. The homogeneous part of the system (2.29) (i.e., setting 
a — 0) is symmetric hyperbolic but not strictly hyperbolic. Difficulties can arise in studying 
certain nonstrictly hyperbolic equations, as discussed briefly in Section 16.2. If A has real 
eigenvalues but is not diagonalizable, then the system is weakly hyperbolic; see Section 16.3. 
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2.9.1 Second-Order Wave Equations 

From the acoustics equations (2.52) we can eliminate the velocity u and obtain a second- 
order equation for the pressure. Differentiating the pressure equation with respect to t and 
the velocity equation with respect to x and then combining the results gives 

Ptt = -K 0 u x , = -K 0 u, x = K () ( — p x ) = clp xx . 

\Po ) x 

This yields the second-order wave equation of the classical form 

Ptt = clp xx (c 0 = constant). (2.76) 

This is also a hyperbolic equation according to the standard classification of second-order 
differential equations. In this book, however, we concentrate almost entirely on first-order 
hyperbolic systems as defined at the start of Section 2.9. There is a certain equivalence 
as suggested by the above transformation for acoustics. Conversely, given a second-order 
equation of the type (2.76), we can derive a first-order hyperbolic system by defining new 
variables 



q l — p t , q 1 = -p x , 

so that (2.76) becomes q) + c^q 7 : — 0, while the equality of mixed partial derivatives gives 
qf + ql =0. These two equations taken together give a system q, + A qx = 0, with the 
coefficient matrix 



A = 



0 

1 




(2.77) 



This matrix is similar to the matrix A of (2.5 1), meaning that there is a similarity transfor- 
mation A — S A S 1 relating the two matrices. The matrix S relates the two sets of variables 
and leads to a corresponding change in the eigenvector matrix, while the eigenvalues of the 
two matrices are the same, ±co- 

Many books take the viewpoint that the equation (2.76) is the fundamental wave equation 
and a first-order system can be derived from it by introducing “artificial” variables such 
as p, and p x . In fact, however, it is the first-order system that follows directly from the 
physics, as we have seen. Since effective numerical methods are more easily derived for 
the first-order system than for the second-order scalar equation, there is no need for us to 
consider the second-order equation further. 



2.10 Variable-Coefficient Hyperbolic Systems 

A variable-coefficient linear system of PDEs might take the form 

q, + A(x)q x = 0. (2.78) 

This system is hyperbolic at any point x where the coefficient matrix satisfies the conditions 
laid out in Section 2.9. In Section 9.6, for example, we will see that the equations of acoustics 
in a heterogeneous medium (where the density and bulk modulus vary with x) can be written 
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as such a system which is hyperbolic everywhere, with eigenvalues given by ±c(x), where 
the sound speed c(x) varies with position depending on the material parameters. 

In some cases we might have a conservative system of linear equations of the form 

q, + ( A{x)q) x = 0, (2.79) 

in which the flux function f(q, x) — A(x)q depends explicitly on x. This system could be 
rewritten as 



q, + A(x)q x = —A'(x)q (2.80) 

as a system of the form (2.78) with the addition of a source term. Again the problem is 
hyperbolic at any point where A(x) is diagonalizable with real eigenvalues. Such problems 
are discussed further in Chapter 9. 



2.11 Hyperbolicity of Quasilinear and Nonlinear Systems 

A quasilinear system 



q, + A(q,x,t)q x = 0 (2.81) 

is said to be hyperbolic at a point (q, x, t) if the matrix A{q , x , t) satisfies the hyperbolicity 
condition (diagonalizable with real eigenvalues) at this point. 

The nonlinear conservation law (2.11) is hyperbolic if the Jacobian matrix f'(q) appear- 
ing in the quasilinear form (2.41) satisfies the hyperbolicity condition for each physically 
relevant value of q . 

Example 2.1. The nonlinear equations of isentropic gas dynamics (2.38) have the Jacobian 
matrix (2.45). The eigenvalues are 

A 1 = u — c, L 2 — u + c, 

where the velocity u may now vary from point to point, as does the sound speed 

c = sfP\p). ( 2 . 82 ) 

However, since P\p) > 0 at all points in the gas, this nonlinear system is strictly hyperbolic. 
(Provided we stay away from the “vacuum state” where p and p go to zero. For the equation 
of state (2.35), c —> 0 as well in this case, and the nonstrict hyperbolicity at this point causes 
additional difficulties in the nonlinear analysis.) 

Solutions to nonlinear hyperbolic systems also involve wave propagation, and for a 
system of m equations we will often be able to find m independent waves at each point. 
However, since the wave speeds depend on the value of the solution q , wave shapes will typ- 
ically deform, and the solution procedure is greatly complicated by this nonlinear structure. 
Nonlinear conservation laws are discussed starting in Chapter 1 1 . 

In the remainder of this chapter some other hyperbolic systems are introduced. These 
sections can be skipped at this point without loss of continuity. 
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2.12 Solid Mechanics and Elastic Waves 

The equations of linear acoustics were derived in Section 2.8 by linearizing the equations 
of isentropic gas dynamics. Essentially the same system of equations can be derived from 
elasticity theory to obtain the equations modeling a one-dimensional acoustic wave in a 
solid, which again is a small-amplitude compressional disturbance in which the material 
moves back and forth in the same direction as the wave propagates, leading to small-scale 
changes in density and pressure. Unlike a gas or liquid, however, a solid also supports a 
second distinct type of small-amplitude waves called shear waves , in which the motion of 
the material is orthogonal to the direction of wave propagation. These two types of waves 
travel at distinct speeds, as illustrated in Figure 2.2. In general these two types of waves are 
coupled together and the equations of linear elasticity are a single set of hyperbolic equations 
that must be solved for all motions of the solid, which are coupled together. However, if 
we restrict our attention to one-dimensional plane waves, in which all quantities vary only 
in one direction, then these equations decouple into two independent hyperbolic systems 
of two equations each. Mathematically these linear systems are not very interesting, since 
each has the same structure as the acoustics equations we have already studied in detail. 
Because of this, however, some of the basic concepts of wave propagation in solids can be 
most easily introduced in this context, and this foundation will be useful when we develop 
the multidimensional equations. 

Figure 2.2 shows the two distinct types of plane- wave motion in an elastic solid. Other 
types of waves can also be observed in solids, such as surface waves at a free surface or 
interface between different solids, but these have a truly multidimensional structure. The 
strips shown in Figure 2.2 should be viewed as taken from an infinite three-dimensional solid 
in which all quantities vary only with x, so the motion shown extends infinitely far in the 




Fig. 2.2. Illustration of (a) P-waves and (b) S-waves in elastic solids. Time advances going upwards. 
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y (upward) direction and in z (normal to the page). Related one-dimensional equations can 
also be used to study elastic motion in a finite elastic bar, but then additional complications 
arise in that a compression in the x-direction will typically result in some spreading in the 
y- and ^-directions, whereas in the infinite plane-wave case there is no opportunity for such 
spreading; see Chapter 22. 

In Figure 2.2(a) the material is compressed at the left edge by a motion in the x-direction 
confined to a small region in x. This compressional wave moves in the x-direction at some 
speed c p and is analogous to an acoustic wave in a gas. Compressing the material leads to 
an increase in stress and hence to acceleration in the x-direction. These changes in stress 
and velocity are coupled together to result in wave motion. 

Figure 2.2(b) shows a different type of wave, a shear wave in which the material is 
displaced in the y-direction over a small region in x. In a gas or liquid, a shear displacement 
of this type would not result in any restoring force or wave motion. There is no compression 
or expansion of the material, and hence no stress results. Molecules of a gas or liquid may 
be freely rearranged as long as there is no change in the pressure and there will be no 
restoring force. Of course frictional (viscous) forces will arise during a rearrangement as 
the molecules move past one another, but once rearranged they are not pulled back towards 
their original locations. A solid is fundamentally different in that the constituent molecules 
are attached to one another by chemical bonds that resist any deformations. The bonds will 
stretch slightly to allow small elastic deformations, but like tiny springs they exert a restoring 
force that typically grows with the magnitude of any deformation. This operates like pressure 
in the case of compressional waves, but these bonds also resist shear deformations, and the 
restoring forces result in shear waves as illustrated in Figure 2.2(b). These waves move at 
a speed c s that we will see is always smaller then the speed c p of compressional waves. 
The two types of waves are often called P-waves and S-waves, with “P” and “S” having 
two possible interpretations: "pressure” and “shear” waves, or alternatively “primary” and 
“secondary” waves in view of the fact that c p > c s and so the P-wave arising from some 
disturbance always arrives at a distant observer before the S-wave. 

The theory of linear elasticity results from assuming that the deformations are small 
enough that the restoring force is linearly related to an appropriate measure of the defor- 
mation of the solid. For larger deformations the response may be nonlinear. The material 
is still elastic if the deformation is sufficiently small that the material can be expected to 
eventually return to its original configuration if all external forces are removed. If the defor- 
mation is too extreme, however, the material may simply fail (fracture or break), if enough 
bonds are irreparably broken, or it may undergo a plastic deformation, in which bonds are 
broken and reformed in a new configuration so that the resulting solid has a different resting 
configuration from that of the original solid. The theory of plasticity then applies. 



2.12.1 Elastic Deformations 

The mathematical notation of solid mechanics is somewhat different from that of fluid 
dynamics. For an elastic body we are typically concerned with small displacements about 
some reference configuration, the location of the body at rest, for example, and so it makes 
sense to consider the actual location of material points at some time as a function of their 
reference location. For example, in two space dimensions we can let ( A (x , y), Y (x, y)) 
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represent the location at time t of the material whose reference location is (x. y). The 
displacement vector 8 is then defined to be 



< 50 , y, t) 



8 1 (x, y, t ) 




X(x, y, t ) 




X 


S 2 (x, y, t ) 




_ Y(x, y, t ) _ 




_y . 



(2.83) 



The symbol u is often used for the displacement vector, but we reserve this for the velocity 
vector, which is the time derivative of the displacement, 



u O, t) 



u{x, y, t) 




8j{x, y, t) 


v(x, y, t ) 




8 2 (x, y, t) 



(2.84) 



Displacements of the body often lead to strains within the body. A strain is a deformation 
that results in changes of length or shape within the body. These strains in turn lead to 
stress, the interior forces due to the stretching or compression of atomic bonds. These 
forces result in acceleration of the material, affecting the motion and hence the evolution of 
the strains. The equations of elasticity consist of Newton’s law relating force to acceleration 
together with stress-strain relations describing the force that results from a given strain. 
This constitutive relation depends on the particular material (similarly to the equation of 
state for a gas). For sufficiently small strains the stress may be assumed to vary linearly 
with strain, resulting in the equations of linear elasticity. 



2.12.2 Strain 

Not all deformations result in a strain. Rigid-body motions (translations and rotations) in 
which the body is simply moved as a rigid entity do not lead to any internal strain or stress. 
Rigid translations correspond to a displacement vector 8{x, y, t) that varies only with t and 
is independent of spatial position. Clearly there will be a strain in the material only if 5 
varies in space, so that some points are displaced relative to other points in the body. Hence 
the strain depends only on the displacement gradient 



VS = 
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Fa 





(2.85) 



where the subscripts denote partial derivatives. Note that for a rigid translation VS = 0. 

We still need to eliminate solid-body rotations, which can be done by splitting VS into 
the sum of a symmetric and a skew- symmetric matrix, 

V8 = e + Q, (2.86) 



with 



and 



e = -[va + (V5)'] = 



n= -[V8-(V8) J ] = 



8l 



m+^y 



Wy + *l) 



o m-w 

-m-s;) o 



(2.87) 



(2.88) 
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The rotation matrix f2 measures rigid rotations, whereas the symmetric matrix e is the 
desired strain matrix , which will also be written as 



The diagonal elements e 11 and e 22 measure extensional strains in the x- and y-directions, 
whereas e 12 = e 21 is the shear strain. 



Example 2.2. The P-wave shown in Figure 2.2 has a displacement of the form 

5(x, y, t) = 

for some wave form w, and hence 



w(x — Cpt) 

0 



€ — 



w'(x — Cpt) 0 

0 0 



with only e 1 1 nonzero. 

The S-wave shown in Figure 2.2 has a displacement of the form 



S(x , y, t) — 



0 

mix — c s t) 



for some waveform w, and hence 

_ 0 \w'(x — c s t) 

\w'{x — c s t) 0 

with only the shear strain nonzero. 

To study one-dimensional elastic waves of the sort shown in Figure 2.2, we need only 
consider the components e 1 1 and f 12 of the strain and must assume that these are functions 
of ( x , t) alone, independent of y and z.. For two-dimensional elasticity we must consider 
e 22 as well, with all three variables being functions of (x, y, t). For full three-dimensional 
elasticity the displacement vector and strain matrix must be extended to three dimensions. 
The formula (2.87) still holds, and e is now a 3 x 3 symmetric matrix with six independent 
elements, three extensional strains on the diagonal, and three shear strains off the diagonal. 
See Chapter 22 for more discussion of these equations and their proper relation to three- 
dimensional elasticity. 



2.12.3 Stress 

A strain in an elastic body typically results in a restoring force called the stress. For one- 
dimensional elasticity as described above, we need only be concerned with two components 
of the stress: er n (x, t ), the force in the x -direction (the normal stress ), and <r 12 (x, t), the 
force in the y-direction (the shear stress). 
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In one-dimensional linear elasticity there is a complete decoupling of compressional and 
shear effects. The normal stress a 11 depends only on the strain e 11 , while the shear stress 
a 12 depends only on the shear strain e 12 , and these constitutive relations are linear: 

cr 11 = (X + 2/x)e n with X + 2/x > 0, (2.89) 

cr 12 = 2 fie 12 with /x > 0. (2.90) 

Here X and // are the Lame parameters characterizing the material. The parameter /x is also 
called the shear modulus. The parameter X does not have a simple physical meaning, but is 
related to other properties of the material in Section 22. 1 . It is unfortunate that the symbol 
X is standard for this parameter, which should not be confused with an eigenvalue. 



2.12.4 The Equations of Motion 

We are now ready to write down the equations of motion for one-dimensional elastic waves. 
P-waves are governed by the system of equations 



e, 11 - u x = 0, 
pu t — a' 1 = 0, 



(2.91) 



where p > 0 is the density of the material. The first equation follows from the equality 
Xxt = X tx , since 

e n (x, t) = X x (x, t) - 1 =>• e, 11 = X xt , ^ 

u(x, t) — X t (x, t ) =4- u x — X tx . 



The second equation of (2.91) is Newton’s second law since u, is the acceleration. 

The system (2.91) involves both e u and cr 11 , and one of these must be eliminated using 
the constitutive relation (2.89). If we eliminate a 11 , we obtain 




0 -1' 


V 1 ' 


_ —(X + 2 pi)/ p 0 


u 



(2.93) 



This is a hyperbolic system, since the matrix has eigenvalues X — ±c p with 

c p = y/(X + 2 p)/p. (2.94) 



If we instead eliminate f 1 1 we obtain 



■ff 11 ' 
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—{X + 2/x) 
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_ — i/p 


0 


_ u 


X 



(2.95) 



Again the coefficient matrix has eigenvalues ±c p . Note that this form is essentially equiv- 
alent to the acoustic equations derived in Section 2.8 if we identify 



p(x, t) = — a n {x, t ). 



(2.96) 



Since 6 11 measures the extensional stress (positive when the material is stretched, negative 
when compressed), a positive pressure corresponds to a negative value of cr 11 . Note that 
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V 



Fig. 2.3. (a) A typical stress-strain relation a = a(e) for the nonlinear elasticity equation (2.97). 
(b) The equation of state p — p(V ) for isentropic gas dynamics in a Lagrangian frame using the 
p-system (2.108). 



the stress o 1 1 can have either sign, depending on whether the material is compressed or 
stretched, while the pressure in a gas can only be positive. A gas that is “stretched” by 
allowing it to expand to a larger volume will not attempt to contract back to its original 
volume the way a solid will. This is another consequence of the fact that there are no 
intermolecular bonds between the gas molecules. 

One-dimensional nonlinear P -waves can be modeled by the more general form of (2.91) 
given by 




e, - u x = 0, 
pu, - o(e) x = 0, 



(2.97) 



where e is the extensional strain e 1 1 and o 1 1 = n(r ) is a more general nonlinear constitutive 
relation between stress and strain. A typical stress-strain relation might look something like 
what is shown in Figure 2.3(a). In the case shown the derivative of the stress with respect 
to strain decreases as the magnitude of the strain is increased. This is shown for small 
values of |e|, in particular for — e <£ 1, since e = — 1 corresponds to a state of complete 
compression, X x — 0. Elasticity theory typically breaks down long before this. For very 
small deformations e, this nonlinear function can generally be replaced by a linearization 
a = (X + 2p)e, where (A + 2 fi) = ct'( 0). This is the relation (2.90) used in linear elasticity. 

The equations for a linear S-wave are essentially identical to (2.91) but involve the shear 
strain, shear stress, and vertical velocity: 



n 1 

- 2 ^=°’ 
pv, — cr v 12 = 0. 



(2.98) 



The relationship (2.90) is now used to eliminate either e 12 or cr 12 , resulting in a closed 
system of two equations, either 




0 


-1/2' 


V 2- 


.-2 M/P 


0 


. V 



(2.99) 
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if e 12 is used, or 





-p. 

0 



12 



= 0 



( 2 . 100 ) 



if a 12 is used. In either case, the eigenvalues of the coefficient matrix are X — ±c v , with the 
wave speed 



c s = yfdjp- 



( 2 . 101 ) 



In general, p <1 + 2 p and so c s < c p . 

We have assumed that shear-wave motion is in the y-direction. In a three-dimensional 
body one could also observe a plane shear wave propagating in the x -direction for which 
the shear motion is in the "-direction. These are governed by a set of equations identical to 
(2.100) but involving e 13 and ct 13 in place of f 12 and a 12 , and the "-component of velocity 
w in place of v. Shear motion need not be aligned with either the y- or the z-axis, but 
can occur in any direction perpendicular to x. Motion in any other direction is simply a 
linear combination of these two, however, so that there are really two decoupled systems of 
equations for S-waves, along with the system of equations of P-waves, needed to describe all 
plane waves in x . Note that the systems (2.95) and (2. 100) both have the same mathematical 
structure as the acoustics equations studied previously. 

For a general two- or three-dimensional motion of an elastic solid it is not possible 
to decompose the resulting equations into independent sets of equations for P-waves and 
S-waves. Instead one obtains a single coupled hyperbolic system. For motions that are 
fully two-dimensional (but independent of the third direction), one obtains a system of five 
equations for the velocities u, v and the components of the stress tensor a 11 , a 12 , and cr 22 
(or alternatively the three components of the strain tensor). Only in the case of purely one- 
dimensional motions do these equations decouple into independent sets. These decoupled 
systems are related to the full three-dimensional equations in Chapter 22. 



2.13 Lagrangian Gas Dynamics and the /7-System 

The fluid dynamics equations derived in Section 2.6 are in Eulerian form, meaning that x 
represents a fixed location in space, and quantities such as the velocity u(x, t ) refer to the 
velocity of whatever fluid particle happens to be at the point x at time t. Alternatively, the 
equations can be written in Lagrangian form, where fixing the coordinate £ corresponds to 
tracking a particular fluid particle. The Lagrangian velocity U (£, t) then gives the velocity 
of this particle at time t . We must then determine the mapping X(f , t ) that gives the physical 
location of the particle labeled if at time t. This is more like the approach used in elasticity, 
as described in Section 2.12, and in one dimension a system of equations very similar to 
(2.97) results. (The term fluid particle refers to an infinitesimally small volume of fluid, but 
one that still contains a huge number of molecules so that the small-scale random variations 
in velocity can be ignored.) 
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2 Conservation Laws and Differential Equations 



To set up the labeling of points initially, we take an arbitrary physical location xq (say 
xq — 0) and then at each point x assign the label 



$ = 




p(s)ds 



( 2 . 102 ) 



to the particle initially located at x, where p is the initial data for the density. If the density 
is positive everywhere, then this gives a one-one map. Note that f has units of mass and 
the label £ gives the total mass between xq and X(t; , t). Moreover §2 — fi is the total mass 
of all particles between those labeled £ 1 and £2 (at any time t, since particles cannot cross 
in this one-dimensional model). 

The Lagrangian velocity is related to the Eulerian velocity by 



U(f, t) = k(X(§, t), t). 



Since X(i ~ , t) tracks the location of this particle, we must have 

X,(M, t ) = U(M, t). 



We could define a Lagrangian density function similarly, but the conservation of mass 
equation in the Lagrangian framework is more naturally written in terms of the specific 
volume 



v{S,t) = 



1 

p{x(M,t\ ty 



This has units of volume/mass (which is just length/mass in one dimension), so it makes 
sense to integrate this over £. Since integrating the specific volume over a fixed set of 
particles gives the volume occupied by these particles at time t, we must have 

f 2 V(?, f)d% = X(f 2 , t) - XQu t). (2.103) 

Differentiating this with respect to t gives 



d 

dt 







-/* 

hi 



h 9 

— U^,t)dH- 
9? 



(2.104) 



Rearranging this and using the fact that it must hold for all choices of £1 and gives the 
differential form of the conservation law. 



V,-U H = 0. 



(2.105) 



Now consider the conservation of momentum. In Eulerian form pu is the density of 
momentum in units of momentum/volume. In Lagrangian form we instead consider U (f , t ), 
which can be interpreted as the momentum per unit mass, with 
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being the total momentum of all particles between f i and £>■ By conservation of momentum, 
this integral changes only due to flux at the endpoints. Since in the Lagrangian framework 
the endpoints are moving with the fluid, there is no “advective flux” and the only change in 
momentum comes from the pressure difference between the two endpoints, so 

L \ f ' 2 U{^,t)dH = />($!, 0 - p«2.0, 
at 

which leads to the conservation law 



U, + ps= = 0. (2.106) 

If we consider isentropic or isothermal flow, then we have only these two conservation 
laws and the equation of state gives p in terms of V alone. Then (2.105) and (2.106) give 
the system of conservation laws known as the p-system. 



V,-U H = 0, 

u, + piv\ = o. 



(2.107) 



This is another simple system of two equations that is useful in understanding conservation 
laws. It is slightly simpler than the corresponding Eulerian equations (2.38) in that the 
only nonlinearity is in the function p(V). This system if hyperbolic if p'iV ) < 0 (see 
Exercise 2.7). Note that for isentropic flow we have p{V) — kV~ y , corresponding to 
the equation of state (2.35), with the shape shown in Figure 2.3(b). 

Frequently the /7-system is written using lowercase symbols as 



v, — u x = 0, 
u, + p(v) x = 0, 



(2.108) 



and we will generally use this notation when the /7-system is used as a generic example of a 
hyperbolic system. To relate this system to the Eulerian gas dynamics equations, however, 
it is important to use distinct notation as derived above. 

The /7-system (2.108) has a very similar structure to the nonlinear elasticity equation 
(2.97) if we equate p with the negative stress —a as discussed in Section 2.12. Note, 
however, that in the gas dynamics case we must have V > 0 and p > 0, whereas in elasticity 
the stress and strain can each be either positive or negative, corresponding to extension and 
compression respectively (recall Figure 2.3(a)). 



2.14 Electromagnetic Waves 

Electromagnetic waves are governed by Maxwell’s equations. In the simplest case this is 
a hyperbolic system of equations, though in materials where waves are attenuated due to 
induced electric currents these are modified by additional source terms. If we assume there 
is no net electric charge or current in the material through which the wave is propagating, 
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2 Conservation Laws and Differential Equations 



then Maxwell’s equations reduce to 



D, - V x H = 0, 


(2.109) 


B, + V x E = 0, 


(2.110) 


<1 

fa* 

II 

p 


(2.111) 


V • B = 0. 


(2.112) 



Here E. D, B, and H are all vectors with three spatial components. The electric field E and 
the magnetic field B are related to the two other fields D and H via constitutive relations 
that characterize the medium in which the wave is propagating. These are similar to the 
stress-strain relations needed in elasticity theory. In general they take the form 

D = eE, (2.113) 

B = nH , (2.114) 

where e is the permittivity and // is the magnetic permeability of the medium. In a homo- 
geneous isotropic material these are both scalar constants. More generally they could be 
3x3 matrices and also vary in space. 

If the initial data satisfies the divergence-free conditions (2. 1 1 1) and (2. 1 12), then it can 
be shown that these will hold for all time, and so equations (2.109) and (2.1 10) for D, and 
B, can be taken as the time evolution equations for electromagnetic waves. 

If e and pi are scalar constants, then we can eliminate I) and H and rewrite the wave- 
propagation equations as 



1 

E, V x B = 0, 

(2.115) 

B, + V x E = 0. 



This is a linear hyperbolic system of equations in three dimensions. 

In this chapter we consider only the simplest case of a plane wave propagating in the 
x-direction. The B- and Zs -fields then oscillate in the y—z plane, so that electromagnetic 
waves are somewhat like shear waves, with the oscillations orthogonal to the direction of 
propagation. However, there are now two fields, and there is an additional relationship that 
the B -field oscillations for a given wave are orthogonal to the £ -field oscillations. Figure 2.4 
illustrates a wave in which the £-field oscillates in y while the 5-field oscillates in z. In 
this case only the y-component of E and the ^-component of B are nonzero, and both vary 
only with x and t. Then E and B have the form 



" 0 ” 




" 0 " 


E 2 (x, t) 


B = 


0 


0 




_B\x,t)_ 



E = 



(2.116) 
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Fig. 2.4. The E and B fields for an electromagnetic plane wave propagating in the x -direction. 
Maxwell’s equations (2.1 15) then reduce to 

E;+—B 3 x = 0, 

(2.117) 

b; + E 2 X = 0 . 



This has exactly the same structure as the one-dimensional linear acoustics and elasticity 
equations considered previously, with the coefficient matrix 



A = 



0 1/e/i 

1 0 



(2.118) 



The eigenvalues are A. 1,2 = ±c, where 






(2.119) 



is the speed of light in the medium. In a vacuum the parameters e and // take particular 
constant values eo and po known as the permittivity and permeability of free space, and 



co 



1 



( 2 . 120 ) 



is the speed of light in a vacuum. For any other medium we have c < co- 
in a heterogeneous medium that consists of one-dimensional layers of isotropic material, 
e and p would be scalar but vary with x and we would obtain the variable-coefficient 
hyperbolic system 



e(x)Ef(x, t) + 




= 0 , 



Bl(x, t) + El(x, t ) = 0. 



( 2 . 121 ) 
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2 Conservation Laws and Differential Equations 



The methods discussed in Section 9.6 for variable-coefficient acoustics could also be applied 
to this system. 

In some media e and/or p may depend on the strength of the electric or magnetic field, 
and hence will vary as a wave passes through. In this case the constitutive relations become 
nonlinear, and Maxwell’s equations yield a nonlinear hyperbolic system of equations. Ac- 
tually, in most materials e and p. do vary with the field strength, but normally the fields 
associated with electromagnetic waves are so weak that the linearized theory is perfectly 
adequate. However, in some problems with very strong fields or special materials it is nec- 
essary to consider nonlinear effects. The field of nonlinear optics is important, for example, 
in the design and study of fiber-optic cables used to transmit pulses of light over thousands 
of kilometers in the telecommunications industry. 



Exercises 



2.1. Derive the equations (2.48) of linear acoustics from the linearized system (2.47). 

2.2. (a) Show that for smooth solutions the conservation laws (2.38) can be manipulated 

into the following set of nonconservative nonlinear equations for the pressure 
and velocity: 



Pi + up x + pP'(p)u x = 0, 

u, + (l/p)p x + uu x = 0, 



( 2 . 122 ) 



where we assume that the equation of state can be inverted to define p as a function 
of p to complete this system. Note that linearizing this nonlinear system about 
some state (po, Po — P(Po)) again gives the acoustics system (2.47). 

(b) Show that the nonlinear system (2.122) is hyperbolic provided P r (p) > 0, and 
has the same characteristic speeds as the conservative version (2.38). 

2.3. Determine the eigenvalues and eigenvectors the matrix A in (2.77) and also the simi- 
larity transformation relating this to A from (2.51) when uq — 0. 

2.4. Determine the eigenvalues and eigenvectors the matrix A from (2.46), and show that 
these agree with (2.57). Determine the similarity transformation relating this matrix 
to A from (2.51). 

2.5. Determine the condition on the function a(e) that is required in order for the nonlinear 
elasticity equation (2.91) to be hyperbolic. 

2.6. Show that X $ (f , t ) = V(£, t) and hence (2.105) is simply the statement that X ^ — X,g. 

2.7. Show that the p-system (2.108) is hyperbolic provided the function p(V) satisfies 
p'(V) < 0 for all V. 

2.8. Isothermal flow is modeled by the system (2.38) with P(p) = a 2 p, where a is constant; 
see Section 14.6. 

(a) Determine the wave speeds of the linearized equations (2.50) in this case. 

(b) The Lagrangian form of the isothermal equations have p(V) = a 2 / V. Linearize 
the p-system (2.107) in this case about Vo, Uq, and compute the wave speeds 
for Lagrangian acoustics. Verify that these are what you expect in relation to the 
Eulerian acoustic wave speeds. 
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Characteristics and Riemann Problems for Linear 
Hyperbolic Equations 



In this chapter we will further explore the characteristic structure of linear hyperbolic sys- 
tems of equations. In particular, we will study solutions to the Riemann problem , which is 
simply the given equation together with very special initial data consisting of a piecewise 
constant function with a single jump discontinuity. This problem and its solution are dis- 
cussed starting in Section 3.8, after laying some more groundwork. This simple problem 
plays a very important role in understanding the structure of more general solutions. It is 
also a fundamental building block for the finite volume methods discussed in this book. 

Linear hyperbolic systems of the form 

q, + Aq x = 0 (3.1) 

were introduced in the last chapter. Recall that the problem is hyperbolic if A e R mxm is 
diagonalizable with real eigenvalues, so that we can write 

A = RAR~\ (3.2) 

where R is the matrix of right eigenvectors. Then introducing the new variables 

w = R~ x q 



allows us to reduce the system (3.1) to 



w f + A w x — 0, (3.3) 

which is a set of m decoupled advection equations. Note that this assumes A is constant. 
If A varies with x and/or f, then the problem is still linear, but R and A will typically 
depend on x and t as well and the manipulations used to obtain (3.3) are no longer valid. 
See Chapter 9 for discussion of variable-coefficient problems. 



3.1 Solution to the Cauchy Problem 

Consider the Cauchy problem for the constant-coefficient system (3.1), in which we are 
given data 



q(x , 0) = q(x) for — oo < x < oo. 
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From this data we can compute data 



Six ) = R l q(x ) 

for the system (3.3). The p\h equation of (3.3) is the advection equation 

w p + X p w p = 0 (3.4) 



with solution 



w p (x, t ) = w p (x — X p t, 0) = w p (x — X p t). 



Having computed all components w p (x, t) we can combine these into the vector w(x, t), 
and then 



q(x, t) = Rw(x , t) (3.5) 

gives the solution to the original problem. This is exactly the process we used to obtain the 
solution (2.68) to the acoustics equations in the previous chapter. 



3.2 Superposition of Waves and Characteristic Variables 

Note that we can write (3.5) as 



q{x,t) = Y^w P {x,t)r p , (3.6) 

p = i 



so that we can view the vector q(x, t) as being some linear combination of the right eigen- 
vectors r 1 , , r m at each point in space-time, and hence as a superposition of waves 
propagating at different velocities X p . The scalar values w p (x, t) for p — I .... , m give 
the coefficients of these eigenvectors at each point, and hence the strength of each wave. 
The requirements of hyperbolicity insure that these m vectors are linearly independent and 
hence every vector q has a unique representation in this form. The manipulations resulting 
in (3.4) show that the eigencoefficient w p (x) — w p (x, 0) is simply advected at constant 
speed X p as time evolves, i.e., w p (x, t) = w p (x o) all along the curve X{t) — xo+X p t. These 
curves are called characteristics of the pth family, or simply p-characteristics. These are 
straight lines in the case of a constant-coefficient system. Note that for a strictly hyperbolic 
system, m distinct characteristic curves pass through each point in the x-t plane. 

The coefficient w p {x, t) of the eigenvector r p in the eigenvector expansion (3.6) of q(x, t) 
is constant along any /^-characteristic. The functions w p (x, t) are called the characteristic 
variables. 

As an example, for the acoustics equations with A given by (2.5 1), we found in Section 2.8 
that the characteristic variables are — p + Zqii and p + Zqu (or any scalar multiples of these 
functions), where Zq is the impedance; see (2.67). 




3.5 Acoustics 
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3.3 Left Eigenvectors 

Let L = R 1 , and denote the rows of the matrix L by l 1 , £ 2 , . These row vectors 
are the left eigenvectors of the matrix A, 

l p A = X p i p , 



whereas the r p are the right eigenvectors. For example, the left eigenvectors for acoustics 
are given by the rows of the matrix R~ l in (2.66). 

We can write the characteristic variable w p (x,t), which is the pth component of R 1 q(x , t) 
= Lq(x , t), simply as 



w p (x, t ) = l p q(x , t). (3.7) 

We can then rewrite the solution q ix , t) from (3.6) succinctly in terms of the initial data q 
as 

m 

q(x , t) — E U p q(x-X p t)]r p . (3.8) 

p= i 



3.4 Simple Waves 

We can view the solution q(x. t) as being the superposition of m waves, each of which is 
advected independently with no change in shape. The pth wave has shape u> p (x)r p and 
propagates with speed )J’ . This solution has a particularly simple form if w p (x , 0) is constant 
in x for all but one value of p, say w p (x) = w p for p i. Then the solution has the form 

q(x, t) — w'(x — X't)r' + w p r p (3.9) 

— q(x — X't) 

and the initial data simply propagates with speed A.'. Since m — 1 of the characteristic 
variables are constant, the equation essentially reduces to q, + )J q x = 0, which governs 
the behavior of the ;th family. Nonlinear equations have analogous solutions, called simple 
waves, in which variations occur only in one characteristic family; see Section 13.8. 



3.5 Acoustics 

An arbitrary solution to the acoustics equations, as derived in Section 2.8, can be decomposed 
as in (3.6), 



P(x, t) 

u(x, t) 



w 1 (x , t) 




+ w 2 (x, t ) 




(3.10) 



where w l — [—p + Zqu]/2Zq is the strength of the left-going 1-wave, and w 2 = [p + 
Zqu]/2Zo is the strength of the right-going 2-wave. The functions w l (x, t) and w 2 (x, t) 
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satisfy scalar advection equations, 

w } — cowl = 0 and wf + cqw 2 x = 0, 

so from arbitrary initial data we can compute 

w l (x, t) = w 1 (x + cot, 0) = w l (x + Cot), 

W 2 (x, t) — UT(x — Cot, 0) = W 2 {x — Cot), 

where w(x) = R~ l q(x) is the initial data for w, and (3.6) agrees with (2.68). 

The advection equations (3.11) are often called the one-way wave equations, since each 
one models the strength of an acoustic wave going in only one direction. 

If one of the characteristic variables w l or i/r is identically constant, then the solution 
(3.10) is a simple wave as defined in Section 3.4. Suppose, for example, that w l = u> l — 
constant, in which case 



(3.11) 



(3.12) 



q(x, t) = w V 1 + w 2 {x — cot)r 2 . 

In this case it is also easy to check that the full solution q satisfies the one-way wave equation 
q t + c 0 q_ x = 0. 

Simple waves often arise in physical problems. Suppose for example that we take initial 
data in which p — u — 0 everywhere except in some small region near the origin. If we choose 
p and u as arbitrary functions in this region, unrelated to one another, then the solution will 
typically involve a superposition of a left-going and a right-going wave. Figure 3.1 shows 
the time evolution in a case where 

1 9 

p(x, 0) = - exp(— 80x“) + S(x), 

2 (3.13) 

u(x, 0) = 0, 

with 

1 if -0.3 < x < -0.1, 

X 0 otherwise. 

For small time the solution changes in a seemingly haphazard way as the left-going and 
right-going waves superpose. But observe that eventually the two waves separate and for 
larger t their individual forms are easy to distinguish. Once they have separated, each wave 
is a simple wave that propagates at constant velocity with its shape unchanged. In this 
example po — 1 and Kq — 0.25, so that cq — Zq — 1/2. Notice that the left-going wave 
has p — — m/2 while the right-going wave has p — m/2, as expected from the form of the 
eigenvectors. 



3.6 Domain of Dependence and Range of Influence 

Let (X, T) be some fixed point in space-time. We see from (3.8) that the solution q(X , T) 
depends only on the data q at m particular points X — X P T for p — 1.2,..., ni. This set 
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Fig. 3.1. Evolution of an initial pressure perturbation, concentrated near the origin, into distinct 
simple waves propagating with velocities —Cq and c 0 . The left column shows the pressure pertur- 
bation q l = p, and the right column shows the velocity q 2 = u. (Time increases going downwards.) 
[claw/book/chap3/ acousimple] 

of points. 



V(X, T)={X- k p T : p= 1,2,..., m}, (3.14) 

is called the domain of dependence of the point (X, T). See Figure 3.2(a). The value of the 
initial data at other points has no influence on the value of q at (X, T). 

For hyperbolic equations more generally, the domain of dependence is always a bounded 
set, though for nonlinear equations the solution may depend on data over a whole interval 
rather than at only a finite number of distinct points. The bounded domain of dependence 
results from the fact that information propagates at finite speed in a hyperbolic equation, as 
we expect from wave motion or advection. This has important consequences in the design 
of numerical methods, and means that explicit methods can often be efficiently used. 

By contrast, for the heat equation q t — f>q xx , the domain of dependence of any point 
(X, T) is the entire real line. Changing the data anywhere would in principle change the 
value of the solution at (X, T), though the contribution dies away exponentially fast, so 
data at points far away may have little effect. Nonetheless, this means that implicit numer- 
ical methods are often needed in solving parabolic equations. This is discussed further in 
Section 4.4 in relation to the CFL condition. 
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T) xq + X 1 t xq + A 2 t xq + A 3 t 





Fig. 3.2. For a typical hyperbolic system of three equations with X 1 < 0 < X 2 < X 3 , (a) shows the 
domain of dependence of the point (X, T ), and (b) shows the range of influence of the point x 0 . 



Rather than looking at which initial data affects the solution at ( X . T), we can turn things 
around and focus on a single point xq at time t — 0, and ask what influence the data q(x o) 
has on the solution q(x, t). Clearly the choice of data at this point will only affect the 
solution along the characteristic rays xo + L p t for p — 1,2,..., m. This set of points 
is called the range of influence of the point xq. The range of influence is illustrated in 
Figure 3.2(b). 



3.7 Discontinuous Solutions 

While classical solutions of differential equations must be smooth (sufficiently differen- 
tiable) functions, the formula (3.6) can be used even if the initial data q(x) is not smooth, or 
is even discontinuous, at some points. If the data has a singularity (a discontinuity in some 
derivative) at some point xo, then one or more of the characteristic variables vu p (x, 0) will 
also have a singularity at this point. Such singularities in the initial data can then propagate 
along the characteristics and lead to singularities in the solution q(x. t) at some or all of the 
points vo + A p t. 

Conversely, if the initial data is smooth in a neighborhood of all the points x — X p t, then 
the solution q(x, 1 ) must be smooth in a neighborhood of the point (x. t). This means that 
singularities can only propagate along characteristics for a linear system. 



3.8 The Riemann Problem for a Linear System 

The Riemann problem consists of the hyperbolic equation together with special initial data 
that is piecewise constant with a single jump discontinuity. 



q(x) 



ti ifx < 0, 
q r ifx > 0. 



By the remarks in Section 3.7, we expect this discontinuity to propagate along the charac- 
teristic curves. 

For the scalar advection equation q t + uq x — 0, the coefficient “matrix” is the lxl scalar 
value it. The single eigenvalue is X 1 — u, and we can choose the eigenvector to be r 1 = 1. 
The solution to the Riemann problem consists of the discontinuity q r — q/ propagating at 
speed u. along the characteristic, and the solution is q(x, t) — q{x — ut). 




3.8 The Riemann Problem for a Linear System 
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For a general m x m linear system we can solve the Riemann problem explicitly using 
the information we have obtained above. It is very important to understand the structure 
of this solution, since we will see later that Riemann solutions for nonlinear conservation 
laws have a similar structure. Moreover, many of the numerical methods we will discuss 
(beginning in Chapter 4) are based on using solutions to the Riemann problem to construct 
approximate solutions with more general data. 

For the Riemann problem we can simplify the notation if we decompose qi and q, as 



— w 'i r p and <ir — ; 

p = i p = i 

Then the pfh advection equation (3.4) has Riemann data 



op, , 

w (x) — 



w>; ifx < 0 , 
wf lfx > 0 , 



and this discontinuity simply propagates with speed X p , so 

[ wf if x — X p t < 0, 
w p {x,t)=\ ' 

I Wr lfx — X P t > 0. 



(3.15) 



(3.16) 



(3.17) 



If we let P(x. t) be the maximum value of p for which x — X p t > 0, then 

P(x,t ) m 

q(x,t)= ^ wfr p + ^2 wfr p , (3.18) 

p= 1 p=P(.x,t )+ 1 

which we will write more concisely as 

q(x, t) = ^2 Wr>' p + ^2 w j’r p . (3.19) 

p:\r<x/t p:\P>x/t 

The determination of q (x , t) at a given point (X, T) is illustrated in Figure 3.3. In the case 
shown, w l — id* while w 2 — wj and w 3 = wf. The solution at the point illustrated is thus 

q(X, T ) = w] r 1 + wjr 2 + wfr 3 . (3.20) 

Note that the solution is the same at any point in the wedge between the x = X l t and x = X 2 t 
characteristics. As we cross the pfh characteristic, the value of x — X p t passes through 0 
and the corresponding w p jumps from wf to wf. The other coefficients w l (i f- p) remain 
constant. 

The solution is constant in each of the wedges as shown in Figure 3.3. Across the pfh 
characteristic the solution jumps with the jump in q given by 



(wf - wf)r p = a p r p . 



(3.21) 



Note that this jump in q is an eigenvector of the matrix A (being a scalar multiple of r p ). 
This is an extremely important fact, and a generalization of this statement is what will 
allow us to solve the Riemann problem for nonlinear systems of equations. This condition, 
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x = A 1 t x = A 2 t x = A 3 t 




Fig. 3.3. Construction of the solution to the Riemann problem at ( X , T). We trace back along the pth 
characteristic to determine the value of w p from the initial data. The value of q is constant in each 
wedge of the x-t plane: q/ = wjr l + wjr 2 +wfr 3 q* = wlr 1 +wfr 2 +wfr 3 q* = wlr 1 +w 2 r 2 +wfr 3 
q r = w 3 r 1 + w 2 r 2 + w 3 r 3 . Note that the jump across each discontinuity in the solution is an eigenvector 
of A. 

called the Rankine-Hugoniot jump condition, will be derived from the integral form of the 
conservation law and seen to hold across any propagating discontinuity; see Section 11.8. 
Typically the given data ( qi , q r ) will not satisfy this condition, and the process of solving 
the Riemann problem can be viewed as an attempt to split up the jump q r — qi into a series 
of jumps, defining the different waves, each of which does satisfy this condition. 

For the case of a linear system, solving the Riemann problem consists of taking the initial 
data ( qi , q r ) and decomposing the jump q r — qi into eigenvectors of A: 

q r - qi = a\ l +---+a m r m . (3.22) 

This requires solving the linear system of equations 

Ra = q r — qi (3.23) 

for the vector a, and so a — R~ 1 (q r — qi). The vector a has components a p = t p (q r — qi), 
where i p is the left eigenvector defined in Section 3.3, and a p = w'j — wf. Since a p r p is 
the jump in q across the pth wave in the solution to the Riemann problem, we introduce 
the notation 



W p = a p r p 



(3.24) 



for these waves. 

The solution q(x, t) from (3.8) can be written in terms of the waves in two different 
forms: 



q(x,t) = qi + WP (3 ' 25) 

p'XP <xjt 

= c lr - WP - 

p:X p >x/t 



(3.26) 
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This can also be written as 



q(x , t) — qi + H(x — X p t)W p , 

p= 1 



where H(x) is the Heaviside function 



H(x) = 



0 if x < 0, 

1 if x > 0. 



(3.27) 



(3.28) 



3.9 The Phase Plane for Systems of Two Equations 

It is illuminating to view the splitting of q r — q\ in state space, often called the phase plane 
for systems of two equations. This is simply the q l -q 2 plane, where q = (q 1 , q 1 ). Each 
vector qix, t) is represented by a point in this plane. In particular, qi and q r are points in 
this plane, and a discontinuity with left and right states cp and q r can propagate as a single 
discontinuity only if q, — qi is an eigenvector of A, which means that the line segment from 
qi to q r must be parallel to the eigenvector r 1 or r 1 . Figure 3.4 shows an example. For the 
state qi illustrated there, the jump from q\ to q r can propagate as a single discontinuity if 
and only if q r lies on one of the two lines drawn through qi in the directions r 1 and r 2 . These 
lines give the locus of all points that can be connected to q\ by a 1-wave or a 2- wave. This 
set of states is called the Hugoniot locus. We will see that there is a direct generalization of 
this to nonlinear systems in Chapter 13. 

Similarly, there is a Hugoniot locus through any point q r that gives the set of all points 
qi that can be connected to q, by an elementary p- wave. These curves are again in the 
directions r 1 and r 2 . 

For a general Riemann problem with arbitrary cp and q r , the solution consists of two 
discontinuities traveling with speeds l 1 and X 2 , with a new constant state in between that 
we will call q m . By the discussion above. 



q m — + wfr 2 . 



(3.29) 



so that q m — cp = (vo\ — wj)r 1 and q r — q m — (w 2 — w 2 )r 2 . The location of q m in the phase 




Fig. 3.4. (a) The Hugoniot locus of the state qi consists of all states that differ from q\ by a scalar 
multiple of r 1 or r 2 . (b) Solution to the Riemann problem in the x-t plane. 
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(a) 





Fig. 3.5. The new state q,„ arising in the solution to the Riemann problem for two different choices of 
qi and q r . In each case the jump from q / to q m lies in the direction of the eigenvector r 1 corresponding 
to the lower speed, while the jump from q m to q r lies in the direction of the eigenvector r 2 . 



plane must be where the 1 -wave locus through qj intersects the 2-wave locus through q, . 
This is illustrated in Figure 3.5(a). 

Note that if we interchange q r and qi in this picture, the location of q m changes as 
illustrated in Figure 3.5(b). In each case we travel from qi to q r by first going in the 
direction r l and then in the direction r 2 . This is required by the fact that a 1 < a 2 , since 
clearly the jump between qi and q m must travel slower than the jump between q m and q r 
(see Figure 3.4(b)) if we are to obtain a single-valued solution. 

For systems with more than two equations, the same interpretation is possible but becomes 
harder to draw, since the state space is now m -dimensional. Since the m eigenvectors r p 
are linearly independent, we can decompose any jump q r — cp into the sum of jumps in 
these directions via (3.22), obtaining a piecewise linear path from qi to q, in m -dimensional 
space. 



3.9.1 Acoustics 

As a specific example, consider the acoustics equations discussed in Sections 2. 7-2. 8 with 
hq = 0, 
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K q ~ 


" P' 


li 
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Mpo 


0 _ 


u 



= 0 . 



(3.30) 



The eigenvalues and eigenvectors of A are given by (2.54) and (2.58). The phase plane is the 
p-u plane, and the eigenvectors are symmetric about the M-axis as indicated in Figure 3.6(a). 
Solving the general Riemann problem gives a = R~ x {q r — qi ) with components 



a 1 = Z l (q r - qi) — 
a 2 — Z 2 (q r - q{) — 



~(Pr ~ Pi) + Z 0 {u r - u,) 

2Z 0 

(Pr ~ Pi) + Z 0 {u r ~ Ul) 

2Z 0 



(3.31) 
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Fig. 3.6. (a) Eigenvectors for the acoustics equations in the p-u phase plane, where Z 0 is the 
impedance, (b) Solution to a Riemann problem in which u / = u, = 0 and p r < p\. 



and the waves are W 1 = a'r 1 and W 2 = orr 2 . The intermediate state is 



q m = qi +aV 



1 

2 



( Pi + p r ) ~ Z 0 (u r - ui) 
(«/ + u r ) - (p r - p,)/Z Q 



(3.32) 



Example 3.1. Consider a Riemann problem in which ui = u, = 0 and there is only a jump 
in pressure with p, < p /. The phase-plane solution to the Riemann problem is sketched in 
Figure 3.6(b), and we compute that 



1 Pi - Pr 2 Pr - PI 

a — , a = 

2Z 0 

so that the intermediate state is 

q,n =qi+ a'r 1 = q r - a 2 r 2 = ^ 

(Recall that p represents the perturbation of pressure from the constant state p (l , so it is fine 
for it to be negative.) 



Pi + p r 

~{Pr ~ Pi)/Z 0 



3.10 Coupled Acoustics and Advection 

Now consider acoustics in a fluid moving at constant speed u o > 0, and to make the problem 
more interesting suppose that there is also a passive tracer being advected in this fluid, with 
density denoted by <p(x , t). Then we can solve the acoustics and advection equation together 
as a system of three equations, 



p " 




M 0 


Kq 


0 " 
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1/A) 
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M 0 _ 


A. 



Of course the acoustics and advection could be decoupled into two separate problems, but 
it is illuminating to solve the Riemann problem for this full system, since its structure is 
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closely related to what is seen in the nonlinear Euler equations of gas dynamics studied 
later, and is also important in solving two-dimensional acoustics (Section 18.4). 

The coefficient matrix in (3.33) has eigenvalues 



= uq — co, X — Mo, X^ = uq + co, 
and corresponding eigenvectors 



r 1 = 


'-Z 0 " 
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, r 2 = 


1 

0 o 
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"Z 0 ' 
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The solution to the Riemann problem is easily determined: 
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where 

, 1 

a = —~[-{p r - Pi) + z 0 (u r — «/)], 
/Zo 

ct = <p r tpi , 

, 1 

a = —[(Pr - Pi) + z 0 (u r - u,)]. 
/Zo 



(3.34) 



(3.35) 



(3.36) 



Note that the 1-wave and 3-wave are standard acoustic waves independent of <p , while the 
2-wave gives the advection of f. 

Suppose f measures the concentration of a dye in the fluid and that </>,- > </>/, so that 
at time t — 0 the fluid to the left is dark while the fluid to the right is light. Then the 
2-wave marks the interface between the dark and light fluids as time evolves, as indicated 
in Figure 3.7. The two fluids remain in contact across this discontinuity in cp, which has 
no dynamic effect, since this tracer does not affect the fluid dynamics and the pressure and 
velocity are both constant across the 2-wave, This wave is called a contact discontinuity. 

Within each of the two fluids there is an acoustic wave moving at speed co (relative to the 
fluid) away from the origin. The jump in pressure and/or velocity in the original Riemann 
data creates a “noise,” that moves through the fluids at the speed of sound. 




Fig. 3.7. Solution to the Riemann problem for the coupled acoustics and advection problem. The 
interface between dark and light fluid advects at the fluid velocity uq , and acoustic waves move at 
speed Co relative to the fluid. The speed of each wave is indicated, (a) A subsonic case, (b) A supersonic 



case. 
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Figure 3.7 shows two different situations. In Figure 3.7(a) the fluid velocity u o is positive 
but subsonic (uq < c o), and so the left-going acoustic wave (the 1-wave) has a negative 
velocity uq — co < 0 relative to a fixed observer. Figure 3.7(b) illustrates a supersonic flow, 
where uq > co and so uq — cq> 0. In this case all three waves propagate to the right and no 
information can propagate upstream from the observer. This distinction is not very important 
in this linear example. In nonlinear gas dynamics the distinction can be very important. The 
ratio M — |«ol/co is called the Mach number of the flow. 



3.11 Initial-Boundary- Value Problems 

Now consider a hyperbolic system on a bounded interval a < x < b. This is called the 
initial-boundary-value problem, or IBVP for short, since it is a time-dependent problem 
for which we need both initial data and boundary data. For a system of m equations we 
need a total of m boundary conditions. Typically some conditions must be prescribed at 
the left boundary x = a and some at the right boundary x — b. How many are required at 
each boundary depends on the number of eigenvalues of A that are positive and negative, 
respectively. 

We considered the IBVP for the advection equation in Section 2. 1 and saw that we need a 
boundary condition only at x = a if u >0 and only at x — b if u < 0. So if we diagonalize 
a general linear system to obtain a decoupled set of advection equations 

Wf + k p w p = 0 , 

then we need to specify boundary data on w p (x, t) atx = a if \ p >0 and at x — b if X p < 0. 
(For now assume all eigenvalues are nonzero, i.e., that the boundary is noncharacteristic.) 

So if the system of m equations has n < m negative eigenvalues and m — n positive 
eigenvalues, i.e., 

k 1 < X 2 < ■ ■ ■ < r < 0 < V ,+1 < < X m , 



then we need to specify m — n boundary conditions at x = a and n boundary conditions at 
x — b. What sort of boundary data should we impose? Partition the vector w as 



w — 



w 

w 11 



(3.37) 



where w 1 e R" and w 11 e R m_ ". Then at the left boundary x — a, for example, we must 
specify the components of tu 11 , while w 1 are outflow variables. It is valid to specify w 11 in 
terms of w l . For example, we might use a linear boundary condition of the form 

u) n (<7, t) = Biw 1 (a, t) + gi(t), (3.38) 

where B\ e and g\ e If B\ — 0, then we are simply specifying given 

values for the inflow variables. But at a physical boundary there is often some reflection of 
outgoing waves, and this requires a nonzero B\ . 

Boundary conditions should be specified as part of the problem and are determined by the 
physical setup - generally not in terms of the characteristic variables, unfortunately. It is not 
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always easy to see what the correct conditions are to impose on the mathematical equation. 
We may have several pieces of information about what is happening at the boundary. Which 
are the correct ones to specify at the boundary? If we specify too few or too many conditions, 
or inappropriate conditions (such as trying to specify the value of an outflow characteristic 
variable), then the mathematical problem is ill posed and will have no solution, or perhaps 
many solutions. It often helps greatly to know what the characteristic structure is, which 
reveals how many boundary conditions we need and allows us to check that we are imposing 
appropriate conditions for a well-posed problem. In Chapter 7 boundary conditions are 
discussed further, and we will see how to impose such boundary conditions numerically. 

Example 3.2. Consider the acoustics problem (2.50) in a closed tube of gas, a < x < b. 
We expect an acoustic wave hitting either closed end to be reflected. Since the system has 
eigenvalues — co and +co, we need to specify one condition at each end (n — m — n — 1 
and w 1 = w l , in 11 = in 2 ). We do not have any information on values of the pressure at the 
boundary a priori, but we do know that the velocity must be zero at each end at all times, 
since the gas cannot flow through the solid walls (and shouldn’t flow away from the walls 
or a vacuum would appear). This suggests that we should set 

u{a, t ) = u(b , t) — 0 (3.39) 

as our two boundary conditions, and this is correct. Note that we are specifying the same 
thing at each end, although the ingoing characteristic variable is different at the two ends. 
From Section 2.8 we know that the characteristic variables are 

u) 1 = — p + Zqu, w 2 — p+Zqu. (3.40) 

We can combine w 1 and w 2 to see that specifying u = 0 amounts to requiring that w 1 + w 2 — 
0 at each end. At x = a we can write this as 



w 2 (a, t ) = — w 1 (a, t), 

which has the form (3.38) with B\ — 1 and g \ = 0. The outgoing wave is completely reflected 
and feeds back into the incoming wave. Conversely, at x — b we can interpret the boundary 
condition u — 0 as 



w l (b, t ) = —w 2 (b, t), 

which sets the incoming variable at this boundary, again by complete reflection of the 
outgoing variable. 

Example3.3. Suppose we set B\ = 0andgi =0in(3. 38), so that this becomes w n (a, f) = 0. 
Then there is nothing flowing into the domain at the left boundary, and any left-going 
waves will simply leave the domain with no reflection. These are called outflow boundary 
conditions. 

Figure 3.8 shows a continuation of the example shown in Figure 3.1 to later times, with 
a solid wall at the left and outflow boundary conditions imposed at the right, which amount 
to setting w l (b, t) = 0 and hence p(b, t) = Zoii(b, t). 
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Fig. 3.8. Continuation of the example shown in Figure 3.1 with a solid wall at the left and outflow 
boundary conditions imposed at the right. Note that the wave that strikes the left boundary is a 
1-wave with p = —u/2, while the reflected wave is a 2-wave with p = u/2. [claw/book/chap3/ 
acousimple] 

Example 3.4. A set of boundary conditions that is often useful mathematically is the 
periodic boundary conditions 



qia , t) — q(b, t). (3.41) 

This set of boundary conditions couples information at the two boundaries, and the idea is 
that waves going out one end should reenter at the other end. Solving the IB VP with periodic 
boundary conditions is equivalent to solving a Cauchy problem with periodic initial data, 
where the data given in a < x < b is periodically extended to the whole real line. 

We are specifying m coupled boundary conditions rather than m — n at one end and n at 
the other, but we can reinterpret (3.41) in terms of the characteristic variables as 



w u (a, t ) = w ll (b, t ), 
w l {b, t) = w l {a, t). 



(3.42) 
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The m — n incoming values w 11 at x = a are specified using the outgoing values at x = /;, 
while the n incoming values w l at x — b are specified using the outgoing values at X — a. 



Exercises 

3.1. For each of the Riemann problems below, sketch the solution in the phase plane, and 
sketch q 1 (x, t) and q 2 (x, t) as functions of x at some fixed time t: 
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3.2. Write a script in Matlab or other convenient language that, given any 2x2 matrix 
A and states qi and q r , solves the Riemann problem and produces the plots required 
for Exercise 3.1. Test it out on the problems of Exercise 3.1 and others. 

3.3. Solve each of the Riemann problems below. In each case sketch a figure in the x-t 
plane similar to Figure 3.3, indicating the solution in each wedge. 
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3.4. Consider the acoustics equations (3.30) with 



A = 



0 

1/A) 



K 0 

0 



p(x) = 



1 if 1 < x < 2, 
0 otherwise, 



u(x) = 0. 



Find the solution for t > 0. This might model a popping balloon, for example (in 
one dimension). 

3.5. Solve the IB VP for the acoustics equations from Exercise 3.4 on the finite domain 
0 < x < 4 with boundary conditions u( 0, t) = n(4, t) = 0 (solid walls). Sketch the 
solution (u and p as functions of x) at times t — 0, 0.5, 1, 1.5, 2, 3. 




Exercises 



63 



3.6. In the problem of Exercise 3.5, what is the domain of dependence of the point 
X = I , T — 10? In this case the domain of dependence should be defined to include 
not only the set of points x where the initial data affects the solution, but also the set 
of times on each boundary where the boundary conditions can affect the solution at 
the point (X, T). 

3.7. Suppose a tube of gas is bounded by a piston at x — 0 and a solid wall at x — 1, 
and that the piston is very slowly pushed into the tube with constant speed 6 -C c, 
where c is the speed of sound. Then we might expect the gas in the tube to be 
simply compressed slowly with the pressure essentially uniform through the tube 
and increasing in time like p — po + ct K {) , where Kq is the bulk modulus. The 
velocity should be roughly linear in x, varying from u — c at the piston to u — 0 at 
the solid wall. For very small e we can model this using linear acoustics on the fixed 
interval 0 < x < 1 with initial data 

u(x) = 0, p(x) — p 0 , 



and boundary conditions 



u(0,t) = e, u(l,t) = 0. 

The solution consists of a single acoustic wave bouncing back and forth between 
the piston and solid wall (very rapidly relative to the wall motion), with p and u 
piecewise constant. Determine this solution, and show that by appropriately averaging 
this rapidly varying solution one observes the expected behavior described above. 
This illustrates the fact that slow-scale motion is sometimes mediated by high-speed 
waves. 

3.8. Consider a general hyperbolic system q, + Aq x = 0 in which /, = 0 is a simple or 
multiple eigenvalue. How many boundary conditions do we need to impose at each 
boundary in this case? As a specific example consider the system (3.33) in the case 
u o = 0. 




Finite Volume Methods 



In this chapter we begin to study finite volume methods for the solution of conservation 
laws and hyperbolic systems. The fundamental concepts will be introduced, and then we 
will focus on first-order accurate methods for linear equations, in particular the upwind 
method for advection and for hyperbolic systems. This is the linear version of Godunov’s 
method, which is the fundamental starting point for methods for nonlinear conservation laws, 
discussed beginning in Chapter 15. These methods are based on the solution to Riemann 
problems as discussed in the previous chapter for linear systems. 

Finite volume methods are closely related to finite difference methods, and a finite vol- 
ume method can often be interpreted directly as a finite difference approximation to the 
differential equation. However, finite volume methods are derived on the basis of the integral 
form of the conservation law, a starting point that turns out to have many advantages. 



4.1 General Formulation for Conservation Laws 

In one space dimension, a finite volume method is based on subdividing the spatial domain 
into intervals (the "finite volumes,” also called grid cells ) and keeping track of an approx- 
imation to the integral of q over each of these volumes. In each time step we update these 
values using approximations to the flux through the endpoints of the intervals. 

Denote the ith grid cell by 



Ci = (*/_ 1 / 2 , JCf+ 1 / 2 ) » 



as shown in Figure 4.1. The value Q" will approximate the average value over the ith 
interval at time t „ : 



1 f Xi+1/1 1 f 

Q, &— q(x, t n )dx = — — / q(x, t n )dx, 

Jxi- 1/2 A * j Ci 



(4.1) 



where Ax — Xj+ 1/2 — .r,-i /2 is the length ofthe cell. For simplicity we will generally assume 
a uniform grid, but this is not required. (Nonuniform grids are discussed in Section 6.17.) 

If q (x, t) is a smooth function, then the integral in (4.1) agrees with the value of q 
at the midpoint of the interval to 0( A.r 2 ). By working with cell averages, however, it is 
easier to use important properties of the conservation law in deriving numerical methods. In 
particular, we can insure that the numerical method is conservative in a way that mimics the 
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tn + 1 
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n 




Fig. 4.1. Illustration of a finite volume method for updating the cell average Q" by fluxes at the cell 
edges. Shown in x—t space. 



true solution, and this is extremely important in accurately calculating shock waves, as we 
will see in Section 12.9. This is because ffi'Lt Q1 Ax approximates the integral of q over 
the entire interval [ a, b ], and if we use a method that is in conservation form (as described 
below), then this discrete sum will change only due to fluxes at the boundaries x — a and 
x = b. The total mass within the computational domain will be preserved, or at least will 
vary correctly provided the boundary conditions are properly imposed. 

The integral form of the conservation law (2.2) gives 

-j- [ t)dx = f{q(xi_ i /2 , t)) - f(q(x i+ 1 /2 , f)). (4.2) 

ot JCi 



We can use this expression to develop an explicit time-marching algorithm. Given Q ", 
the cell averages at time f„, we want to approximate Q" +l , the cell averages at the next time 
t n+ i after a time step of length At — t n+ \ — t„. Integrating (4.2) in time from t„ to t n+ \ 
yields 





/(<?(•*;— 1 / 2 , 




f(q(x i+ 1 / 2 , t))dt. 



Rearranging this and dividing by Ax gives 



— f q(x, t n +\)dx — — — [ q(x, t„)dx 
Ax J c , Ax J c , 



1 

Ax 



/ tn + 1 (*tn + 1 

f(q(x i+ i/ 2 ,t))dt - J' f(q(xi-i /2 ,t))dt . 



(4.3) 



This tells us exactly how the cell average of q from (4.1) should be updated in one time 
step. In general, however, we cannot evaluate the time integrals on the right-hand side of 
(4.3) exactly, since q(Xj± 1 / 2 , t) varies with time along each edge of the cell, and we don’t 
have the exact solution to work with. But this does suggest that we should study numerical 
methods of the form 
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4 Finite Volume Methods 



where F "_ is some approximation to the average flux along x = x, 

1 f' n+1 

F i-1 l2*Ytj, (4.5) 

If we can approximate this average flux based on the values Q", then we will have a fully 
discrete method. See Figure 4.1 for a schematic of this process. 

For a hyperbolic problem information propagates with finite speed, so it is reasonable 
to first suppose that we can obtain F"_ t j 2 based only on the values Q" , and Q", the cell 
averages on either side of this interface (see Section 4.4 for some discussion of this). Then 
we might use a formula of the form 

FU / 2=HQ1-vQ1) ( 4 - 6 ) 

where T is some numerical flux function. The method (4.4) then becomes 

Q'; +l = Qi - Q'i+i) - H&U Q n i)\ ( 4 - 7 ) 

The specific method obtained depends on how we choose the formula T , but in general 
any method of this type is an explicit method with a three-point stencil , meaning that the 
value Q'l +l will depend on the three values Q "_ , , Q ", and Q" +l at the previous time level. 
Moreover, it is said to be in conservation form, since it mimics the property (4.3) of the 
exact solution. Note that if we sum A xQ" +l from (4.4) over any set of cells, we obtain 



At 



Ax^]er 1 = Ax^e' 1 -— (f 



2+1/2 ' 



F /— 1/2)- 



(4.8) 



The sum of the flux differences cancels out except for the fluxes at the extreme edges. Over 
the full domain we have exact conservation except for fluxes at the boundaries. (Numerical 
boundary conditions are discussed later.) 

The method (4.7) can be viewed as a direct finite difference approximation to the con- 
servation law q, + f{q) x = 0, since rearranging it gives 



n ”+ 1 r>n F n — F n 

O; “ Qi + 0+1/2 O-1/2 _ Q 



At 



Ax 



(4.9) 



Many methods can be equally well viewed as finite difference approximations to this equa- 
tion or as finite volume methods. 



4.2 A Numerical Flux for the Diffusion Equation 

The above derivation was presented for a conservation law in which the flux f(q ) depends 
only on the state q. The same derivation works more generally, however, for example if the 
flux depends explicitly on x or if it depends on derivatives of the solution such as q x . As an 
example consider the diffusion equation (2.22), where the flux (2.20) is 



f(q x ,x) = -fflx)q x . 
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Given two cell averages <2;-i and Qi-, the numerical flux T{ Q,-\ . Q,) at the cell interface 
between can very naturally be defined as 

HQi- 1, Qi) = -A -i/2 ( Qi — Y (4.10) 

where /S,_i /2 ^ f(Xi-i/ 2 )- This numerical flux has the natural physical interpretation that 
the conserved quantity measured by q flows from one grid cell to its neighbor at a rate 
proportional to the difference in 0-values in the two cells, with — 1/2 measuring the 
conductivity of the interface between these cells. This is a macroscopic version of Fick’s 
law or Fourier’s law (or Newton’s law of cooling). 

Using (4.10) in (4.7) gives a standard finite difference discretization of the diffusion 
equation, 

Q" +l = Q" + ^[A +1 /2(G? + i - Q") ~ ft-i/2(G7 - QU)l (4-H) 

If /3 = constant, then this takes the simpler form 

Q'U = Q! + -^iPiQU - 2 Qi + Qi+ 1 ) (4.1.2) 

and we recognize the centered approximation to q xx . 

For parabolic equations, explicit methods of this type are generally not used, since they are 
only stable if At = 0( Ax 2 ). Instead an implicit method is preferable, such as the standard 
Crank-Nicolson method, 

Q’! +l = Qi + ^[/W(e? +1 - Qi) ~ Pi-idQ? - QU) 

+ Pi+wiQXl - Q" +1 ) - A-i/2(0' ,+1 - Qtl)l (4-13) 

This can also be viewed as a finite volume method, with the flux 

K -t/2 = - QU) + A-i/ 2 (0' !+1 - Qtl)\ 

This is a natural approximation to the time-averaged flux (4.5), and in fact has the advantage 
of being a second-order accurate approximation (since it is centered in both space and time) 
as well as giving an unconditionally stable method. 

The stability difficulty with explicit methods for the diffusion equation arises from the fact 
that the flux (4. 10) contains Ax in the denominator, leading to stability restrictions involving 
At/(Ax ) 2 after multiplying by At / Ax in (4.4). For first-order hyperbolic equations the 
flux function involves only q and not q x , and explicit methods are generally more efficient. 
However, some care must be taken to obtain stable methods in the hyperbolic case as well. 



4.3 Necessary Components for Convergence 

Later in this chapter we will introduce various ways to define the numerical flux function 
of (4.6) for hyperbolic equations, leading to various different finite volume methods. There 
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are several considerations that go into judging how good a particular flux function is for 
numerical computation. One essential requirement is that the resulting method should be 
convergent , i.e.. the numerical solution should converge to the true solution of the differential 
equation as the grid is refined (as Ax, At -> 0). This generally requires two conditions: 

• The method must be consistent with the differential equation, meaning that it approxi- 
mates it well locally. 

• The method must be stable in some appropriate sense, meaning that the small errors made 
in each time step do not grow too fast in later time steps. 

Stability and convergence theory are discussed in more detail in Chapter 8. At this stage we 
simply introduce some essential ideas that are useful in discussing the basic methods. 



4.3.1 Consistency 

The numerical flux should approximate the integral in (4.5). In particular, if the function 
q{x, t) = q is constant inx, then q will not change with time and the integral in (4.5) simply 
reduces to f(q). As a result, if Q"_ 1 — Q" — q, then we expect the numerical flux function 
T of (4.6) to reduce to f(q), so we require 

Hq,q)=f(q) (4.14) 

for any value q. This is part of the basic consistency condition. We generally also ex- 
pect continuity in this function as Qt - i and Q, vary, so that T( Q, - 1 ■ Q, ) — » fiq) as 
Qi- 1, Qi — > q. Typically some requirement of Lipschitz continuity is made, e.g., there 
exists a constant L so that 



\HQi-u Qi) - f(q)\ < L max(| Q, - q\, \Q i _ l - q |). (4.15) 



4.4 The CFL Condition 

Stability analysis is considered in detail in Chapter 8. Here we mention only the CFL 
condition, which is a necessary condition that must be satisfied by any finite volume or 
finite difference method if we expect it to be stable and converge to the solution of the 
differential equation as the grid is refined. It simply states that the method must be used in 
such a way that information has a chance to propagate at the correct physical speeds, as 
determined by the eigenvalues of the flux Jacobian f'{q). 

With the explicit method (4.7) the value <2" +1 depends only on three values <2''_p Q ", 
and Q n i+l at the previous time step. Suppose we apply such a method to the advection 
equation q, + uq x — 0 with u > 0 so that the exact solution simply translates at speed m 
and propagates a distance u At over one time step. Figure 4.2(a) shows a situation where 
u At < Ax, so that information propagates less than one grid cell in a single time step. 
In this case it makes sense to define the flux at x,_i /2 in terms of Q"_ l and Q" alone. In 
Figure 4.2(b), on the other hand, a larger time step is used with u At > Ax. In this case the 
true flux at x,_i /2 clearly depends on the value of Q"_ 2 , and so should the new cell average 
Q" +1 . The method (4.7) would certainly be unstable when applied with such a large time 
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Fig. 4.2. Characteristics for the advection equation, showing the information that flows into cell C, 
during a single time step, (a) For a small enough time step, the flux at X;_i /2 depends only on the 
values in the neighboring cells - only on <2"_i in this case where u > 0. (b) For a larger time step, 
the flux should depend on values farther away. 

step, no matter how the flux (4.6) was specified, if this numerical flux depended only on 
Q'U and Q". 

This is a consequence of the CFL condition, named after Courant, Friedrichs, and Lewy. 
They wrote one of the first papers on finite difference methods for partial differential 
equations [93] in 1928. (There is an English translation in [94].) They used finite difference 
methods as an analytic tool for proving the existence of solutions of certain PDEs. The 
idea is to define a sequence of approximate solutions (via finite difference equations), prove 
that they converge as the grid is refined, and then show that the limit function must satisfy 
the PDE, giving the existence of a solution. In the course of proving convergence of this 
sequence (which is precisely what we are interested in numerically), they recognized the 
following necessary stability condition for any numerical method: 

CFL Condition: A numerical method can be convergent only if its numerical domain 
of dependence contains the true domain of dependence of the PDE, at least in the limit 
as At and Ax go to zero. 

It is very important to note that the CFL condition is only a necessary condition for 
stability (and hence convergence). It is not always sufficient to guarantee stability. In the 
next section we will see a numerical flux function yielding a method that is unstable even 
when the CFL condition is satisfied. 

The domain of dependence D(X . T) for a PDE has been defined in Section 3.6. The 
numerical domain of dependence of a method can be defined in a similar manner as the 
set of points where the initial data can possibly affect the numerical solution at the point 
( X , T). This is easiest to illustrate for a finite difference method where pointwise values 
of Q are used, as shown in Figure 4.3 for a three-point method. In Figure 4.3(a) we see 
that Qj depends on Qj_ v Q\, Q] +l and hence on . . . , Q®, 2 . Only initial data in the 
interval X — 2 Ax a < x < X + 2 Ax a can affect the numerical solution at (X, T) — (x, , tf). 
If we now refine the grid by a factor of 2 in both space and time ( Ax b — Ax a /2), but 
continue to focus on the same physical point (X, T), then we see in Figure 4.3(b) that 
the numerical approximation at this point now depends on initial data at more points in 
the interval X — 4 Ax h < x < X + 4 Ax h . But this is the same interval as before. If we 
continue to refine the grid with the ratio At / Ax = r fixed, then the numerical domain of 
dependence of a general point (X, T) is X — T / r < x < X + T jr. 





70 



4 Finite Volume Methods 




Fig. 4.3. (a) Numerical domain of dependence of a grid point when using a three-point explicit finite 
difference method, with mesh spacing Ax a . (b) On a finer grid with mesh spacing Ax b — \ Ax a . 
Similar figures can be drawn for finite volume methods. 



In order for the CFL condition to be satisfied, the domain of dependence of the true 
solution must lie within this interval. For the advection equation q, + uq x = 0, for example, 
V(X, T) is the single point X — uT, since q(X , T) = q(X — uT). The CFL condition then 
requires 

X - T/r < X - uT < X + T/r 

and hence 

u At 

v = < I. (4.16) 

Ax 

If this condition is not satisfied, then a change in the initial data q at X — UT would change 
the true solution at (X, T ) but could have no effect on the numerical solution at this point. 
Clearly the method cannot converge to the proper solution for all choices of initial data 
under these circumstances. 

The ratio v in (4.16) is sometimes called the CFL number, or more frequently the Courant 
number. Returning to the finite volume method illustrated in Figure 4.2, note that the Courant 
number measures the fraction of a grid cell that information propagates through in one time 
step. For a hyperbolic system of equations there are generally a set of m wave speeds 
A 1 , . . . , X m as described in Chapter 3, and the true domain of dependence is given by (3.14). 
In this case we define the Courant number by 

v — max |A/|. (4.17) 

Ax p 

For a three-point method the CFL condition again leads to a necessary condition v < 1. 

Note that if the method has a wider stencil, then the CFL condition will lead to a more 
lenient condition on the time step. For a centered five-point stencil in which Q" +i depends 
also on Q1_ 2 and Q" +2 , the CFL condition gives v <2. Again this will only be a necessary 
condition, and a more detailed analysis of stability would be required to determine the actual 
stability constraint needed to guarantee convergence. 

For hyperbolic equations we typically use explicit methods and grids for which the 
Courant number is somewhat smaller than 1. This allows keeping At / Ax fixed as the grid 
is refined, which is sensible in that generally we wish to add more resolution at the same 
rate in both space and in time in order to improve the solution. 
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For a parabolic equation such as the diffusion equation, on the other hand, the CFL 
condition places more severe constraints on an explicit method. The domain of dependence 
of any point (X, T) for T >0 is now the whole real line, T)( X . T ) = (— oo, oo), and data at 
every point can affect the solution everywhere. Because of this infinite propagation speed, 
the CFL condition requires that the numerical domain of dependence must include the 
whole real line, at least in the limit as At, Ax -* 0. For an explicit method this can be 
accomplished by letting At approach zero more rapidly than Ax as we refine the grid, e.g., 
by taking At = 0( Ax 2 ) as required for the method (4.1 1). A better way to satisfy the CFL 
condition in this case is to use an implicit method. In this case the numerical domain of 
dependence is the entire domain, since all grid points are coupled together. 



4.5 An Unstable Flux 

We now return to the general finite volume method (4.4) for a hyperbolic system and consider 
various ways in which the numerical flux might be defined. In particular we consider flux 
functions T as in (4.6). We wish to define the average flux at x,_i /2 based on the data Q'}_ j 
and Q" to the left and right of this point. A first attempt might be the simple arithmetic 
average 

FJU/2 = HQl i- Q" ) = 0 + f(Q")l (4-i8) 

Using this in (4.4) would give 

G" +1 = Q1 - ^[/(07+t) - f{QU)\ (4-19) 

Unfortunately, this method is generally unstable for hyperbolic problems and cannot be used, 
even if the time step is small enough that the CFL condition is satisfied. (See Exercise 8.1.) 



4.6 The Lax-Friedrichs Method 

The classical Lax-Friedrichs (LxF) method has the form 

or 1 = |(er_i + Q'i+i) - 2^[/(e?+ 0 - ^ 

This is very similar to the unstable method (4.19), but the value Q'j is replaced by the 
average \ ( Q "_ , + Q" +1 ). For a linear hyperbolic equation this method is stable provided 
v < 1, where the Courant number v is defined in (4.17). 

At first glance the method (4.20) does not appear to be of the form (4.4). Flowever, it can 
be put into this form by defining the numerical flux as 

1 A x 

HQ1- 1. Q") = ^[f{QU) + f{Q")\ - - Q1- 1). (4.2D 

Note that this flux looks like the unstable centered flux (4.18) with the addition of another 
term similar to the flux (4.10) of the diffusion equation. By using this flux we appear to be 
modeling the advection-diffusion equation q, + f(q) x = fiq xx with /) = \( Ax) 2 / A t. But 
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if we fix At/ Ax, then we see that this coefficient vanishes as the grid is refined, so in the 
limit the method is still consistent with the original hyperbolic equation. This additional 
term can be interpreted as numerical diffusion that damps the instabilities arising in (4.19) 
and gives a method that can be shown to be stable for Courant number up to 1 (which is also 
the CFL limit for this three-point method). However, the Lax-Friedrichs method introduces 
much more diffusion than is actually required, and gives numerical results that are typically 
badly smeared unless a very fine grid is used. 



4.7 The Richtmyer Two-Step Lax-Wendroff Method 

The Lax-Friedrichs method is only first-order accurate. Second-order accuracy can be 
achieved by using a better approximation to the integral in (4.5). One approach is to first 
approximate q at the midpoint in time, t n+ 1/2 = t n +\ At, and evaluate the flux at this point. 
The Richtmyer method is of this form with 

FUn = (4.22) 

where 

Qtljl = \{Q1- 1 + Q!) ~ 2^[/(C?) - f{QU)\ (4-23) 

Note that <2 "+i/ 2 ' s obtained by applying the Lax-Friedrichs method at the cell interface 
with Ax and At replaced by \ Ax and \ At respectively. 

For a linear system of equations, f(q) — Aq, the Richtmyer method reduces to the stan- 
dard Lax-Wendroff method, discussed further in Section 6. 1. As we will see, these methods 
often lead to spurious oscillations in solutions, particularly when solving problems with dis- 
continuous solutions. Additional numerical diffusion (or artificial viscosity) can be added 
to eliminate these oscillations, as first proposed by von Neumann and Richtmyer [477], In 
Chapter 6 we will study a different approach to obtaining better accuracy that allows us to 
avoid these oscillations more effectively. 



4.8 Upwind Methods 

The methods considered above have all been centered methods, symmetric about the point 
where we are updating the solution. For hyperbolic problems, however, we expect infor- 
mation to propagate as waves moving along characteristics. For a system of equations we 
have several waves propagating at different speeds and perhaps in different directions. It 
makes sense to try to use our knowledge of the structure of the solution to determine better 
numerical flux functions. This idea gives rise to upwind methods in which the information 
for each characteristic variable is obtained by looking in the direction from which this 
information should be coming. 

For the scalar advection equation there is only one speed, which is either positive or neg- 
ative, and so an upwind method is typically also a one-sided method, with Q" + 1 determined 
based on values only to the left or only to the right. This is discussed in the next section. 
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For a system of equations there may be waves traveling in both directions, so an up- 
wind method must still use information from both sides, but typically uses characteristic 
decomposition (often via the solution of Riemann problems) to select which information to 
use from each side. Upwind methods for systems of equations are discussed beginning in 
Section 4.10. 



4.9 The Upwind Method for Advection 

For the constant-coefficient advection equation q, + uq x — 0, Figure 4.2(a) indicates that 
the flux through the left edge of the cell is entirely determined by the value Q"_ l in the cell 
to the left of this cell. This suggests defining the numerical flux as 



F’U, 2 = uQU- 

This leads to the standard first-order upwind method for the advection equation, 



e; ,+1 = q; 



u At 
Ax 



(Q" - Ql i). 



(4.24) 



(4.25) 



Note that this can be rewritten as 

yi+l 



er-g? , 8 f g? T 6?-r l=0> 



At 



Ax 



whereas the unstable centered method (4.19) applied to the advection equation is 

G7 +I -e? , -(Qi +l -Q'U 



At 



2 A.r 



= 0. 



The upwind method uses a one-sided approximation to the derivative q x in place of the 
centered approximation. 

Another interpretation of the upwind method is suggested by Figure 4.4(a). If we think 
of the Q" as being values at grid points, Q" ss q (x , , t n ), as is standard in a finite difference 
method, then since q(x,t) is constant along characteristics we expect 



Q" +l q(Xi, t„+ 1 ) = q(Xi - u At, t n ). 




Fig. 4.4. Two interpretations of the upwind method for advection. (a) If Q" represents the value at 
a grid point, then we can trace the characteristic back and interpolate, (b) If Q" represents the cell 
average, then the flux at the interface is determined by the cell value on the upwind side. 
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If we approximate the value on the right by a linear interpolation between the grid values 
Q"_i and Q", we obtain the method 



„ , , u At „ 

Q = Q 

' Ax 



u At 
Ax 



Qi- 



(4.26) 



This is simply the upwind method, since a rearrangement gives (4.25). 
Note that we must have 



0 < 



u At 

< 1 

Ax 



(4.27) 



in order for the characteristic to fall between the neighboring points so that this interpolation 
is sensible. In fact, (4.27) must be satisfied in order for the upwind method to be stable, and 
also follows from the CFL condition. Note that if (4.27) is satisfied then (4.26) expresses 
Q1 +1 as a convex combination of Q’f and <2"_i (i.e., the weights are both nonnegative and 
sum to 1). This is a key fact in proving stability of the method. (See Section 8.3.4.) 

We are primarily interested in finite volume methods, and so other interpretations of the 
upwind method will be more valuable. Figure 4.4(b) and Figure 4.5 show the finite volume 
viewpoint, in which the value Q” is now seen as a cell average of q over the i th grid cell C; . 
We think of mixing up the tracer within this cell so that it has this average value at every 
point in the cell, at time t „ . This defines a piecewise constant function at time t n with the 





(a) (b) 

Fig. 4.5. Wave-propagation interpretation of the upwind method for advection. The bottom pair of 
graphs shows data at time t „ , represented as a piecewise constant function. Over time At this function 
shifts by a distance u At as indicated in the middle pair of graphs. We view the discontinuity that 
originates at A',_i /2 as a wave W,_i/ 2 . The top pair shows the piecewise constant function at the end of 
the time step after advecting. The new cell averages Q" +1 in each cell are then computed by averaging 
this function over each cell, (a) shows a case with u > 0, while (b) shows u < 0. 
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value Q'l in cell C,- . As time evolves, this piecewise constant function advects to the right 
with velocity u, and the jump between states Q"_ l and Q" shifts a distance u At into cell 
C,. At the end of the time step we can compute a new cell average Q" + 1 in order to repeat 
this process. To compute Q" + 1 we must average the piecewise constant function shown 
in the top of Figure 4.5 over the cell. Computing this average results in the same convex 
combination (4.26) as was motivated by the characteristic-based approach of Figure 4.4(a), 
as the reader should verify. 

We can also take a wave-propagation viewpoint, which will prove useful in extending 
and implementing the upwind method. The jump VV,- — i /2 = Q'l — Q'J_ \ can be viewed as a 
wave that is moving into cell C; at velocity u. This wave modifies the value of q by — W;_ 1/2 
at each point it passes. Over the time step it moves a distance ii At and passes through a 
fraction u At / Ax of the grid cell, and hence the cell average is modified by this fraction of 
-Wi- 1 / 2 : 



Q'l +l = Q’l + ^(-VlU/ 2 ). (4.28) 

This again results in the upwind method (4.25). 

In the above discussion we have assumed that it > 0. On the other hand if u < 0 then 
the upwind direction is to the right and so the numerical flux at jc, — 1/2 is 



7 - 1/2 



= uQl. 



(4.29) 



The upwind method then has the form 

Q'! +l = Ql - ^(Qi+1 - Q'l)- ( 4 -30) 

This can also be written in wave-propagation form as 

, 1 u At 

Q'l = Q’l - — W,+i/2, (4.31) 

with W’,+ 1/2 = Q'l + 1 — Q'l ■ All the interpretations presented above carry over to this case 
u < 0, with the direction of flow reversed. The method (4.31) is stable provided that 



11 At 

-1 < 

Ax 



< 0. 



(4.32) 



The two formulas (4.24) and (4.29) can be combined into a single upwind formula that 
is valid for ii of either sign. 



Fl U2 = u-Ql + u + Ql_ v 



(4.33) 



where 



« + = ma x(m, 0), 



u = min(n, 0). 



(4.34) 
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The wave-propagation versions of the upwind method in (4.28) and (4.31) can also be 
combined to give the more general formula 

Qi +1 = Qi ~ ^ (m + W/— 1/2 + u~W i+ 1 / 2 ). (4.35) 

This formulation will be useful in extending this method to more general hyperbolic 
problems. Not all hyperbolic equations are in conservation form; consider for example the 
variable-coefficient linear equation (2.78) or the quasilinear system (2.81) with suitable 
coefficient matrix. Such equations do not have a flux function, and so numerical methods 
of the form (4.7) cannot be applied. However, these hyperbolic problems can still be solved 
using finite volume methods that result from a simple generalization of the high-resolution 
methods developed for hyperbolic conservation laws. The unifying feature of all hyperbolic 
equations is that they model waves that travel at finite speeds. In particular, the solution to a 
Riemann problem with piecewise constant initial data (as discussed in Chapter 3) consists 
of waves traveling at constant speeds away from the location of the jump discontinuity in 
the initial data. 

In Section 4.12 we will generalize (4.35) to obtain an approach to solving hyperbolic 
systems that is more general than the flux-differencing form (4.4). First, however, we see 
how the upwind method can be extended to systems of equations. 



4.10 Godunov’s Method for Linear Systems 

The upwind method for the advection equation can be derived as a special case of the 
following approach, which can also be applied to systems of equations. This will be referred 
to as the REA algorithm , for reconstruct-evolve-average. These are one-word summaries 
of the three steps involved. 

Algorithm 4.1 (REA). 

1. Reconstruct a piecewise polynomial function q"(x, t n ) defined for all x, from the cell 
averages Q'f In the simplest case this is a piecewise constant function that takes the 
value Q" in the ith grid cell, i. e . , 

q'\x, t n ) = Q'j for all x € C, . 

2. Evolve the hyperbolic equation exactly ( or approximately ) with this initial data to obtain 
q"(x, t, i+ i) a time At later. 

3. Average this function over each grid cell to obtain new cell averages 

Q" +l = -p- ( q n (x,t n+ i)dx. 

Ax J c . 

This whole process is then repeated in the next time step. 

In order to implement this procedure, we must be able to solve the hyperbolic equation 
in step 2. Because we are starting with piecewise constant data, this can be done using the 
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theory of Riemann problems as introduced for linear problems in Chapter 3. When applied 
to the advection equation, this leads to the upwind algorithm, as illustrated in Figure 4.5. 

The general approach of Algorithm 4.1 was originally proposed by Godunov [157] as 
a method for solving the nonlinear Euler equations of gas dynamics. Application in that 
context hinges on the fact that, even for this nonlinear system, the Riemann problem with 
piecewise constant initial data can be solved and the solution consists of a finite set of 
waves traveling at constant speeds, as we will see in Chapter 13. 

Godunov’s method for gas dynamics revolutionized the field of computational fluid dy- 
namics, by overcoming many of the difficulties that had plagued earlier numerical methods 
for compressible flow. Using the wave structure determined by the Riemann solution allows 
shock waves to be handled in a properly “upwinded” manner even for systems of equations 
where information propagates in both directions. We will explore this for linear systems in 
the remainder of this chapter. 

In step 1 we reconstruct a function q n ix , t„ ) from the discrete cell averages. In Godunov’s 
original approach this reconstruction is a simple piecewise constant function, and for now we 
concentrate on this form of reconstruction. This leads most naturally to Riemann problems, 
but gives only a first-order accurate method, as we will see. To obtain better accuracy one 
might consider using a better reconstruction, for example a piecewise linear function that 
is allowed to have a nonzero slope cr" in the /th grid cell. This idea forms the basis for the 
high-resolution methods that are considered starting in Chapter 6. 

Clearly the exact solution at time t n+ \ can be constructed by piecing together the Riemann 
solutions, provided that the time step At is short enough that the waves from two adjacent 
Riemann problems have not yet started to interact. Figure 4.6 shows a schematic diagram 
of this process for the equations of linear acoustics with constant sound speed c, in which 
case this requires that 

1 

c At < - Ax, 

~ 2 

so that each wave goes at most halfway through the grid cell. Rearranging gives 



c At 1 

< -. 

A.r 2 



(4.36) 




Fig. 4.6. An illustration of the process of Algorithm 4. 1 for the case of linear acoustics. The Riemann 
problem is solved at each cell interface, and the wave structure is used to determine the exact solution 
time At later. This solution is averaged over the grid cell to determine Q" +l . 
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The quantity c At /Ax is simply the Courant number, so it appears that we are limited in 
(4.36) to a Courant number less than 1 /2. But we will see below that this method is easily 
extended to Courant numbers up to 1 . 



4.11 The Numerical Flux Function for Godunov’s Method 

We now develop a finite volume method based on Algorithm 4. 1 that can be easily imple- 
mented in practice. As presented, the algorithm seems cumbersome to implement. The exact 
solution q n (x, t n+ \) will typically contain several discontinuities and we must compute its 
integral over each grid cell in order to determine the new cell averages Q " 1 1 . However, 
it turns out to be easy to determine the numerical flux function T that corresponds to 
Godunov’s method. 

Recall the formula (4.5), which states that the numerical flux F' l _ y2 should approximate 
the time average of the flux at x,-_ 1/2 over the time step, 

Pi-i/2 ^ J f(q(Xi- U t))dt. 



In general the function q(x,- 1 / 2 , t) varies with t, and we certainly don't know this variation 
of the exact solution. However, we can compute this integral exactly if we replace q(x, t) 
by the function q"{x,1) defined in Algorithm 4.1 using Godunov’s piecewise constant 
reconstruction. The structure of this function is shown in Figure 3.3, for example, and so 
clearly q'\x ,_i/ 2 , t) is constant over the time interval t n < t < t n+ \. The Riemann problem 
centered at x,_i /2 has a similarity solution that is constant along rays (x — x,_i/ 2 )/(t — t n ) — 
constant, and looking at the value along (x — x,_i /2 )/t — 0 gives the value of g"(x;_ 1 / 2 , t). 
Denote this value by Qf_ l//0 = q v ( Q "_ , , Q"). This suggests defining the numerical flux 

F l 1/2 ^ 



T7 n 

M — 1/2 



f(q\QU'Q n i))- 



(4.37) 



This gives a simple way to implement Godunov’s method for a general system of conser- 
vation laws: 

• Solve the Riemann problem at Xi-i /2 to obtain q^iQ"^, Q "). 

• Define the flux F"_ yl = Q ■) by (4.37). 

• Apply the flux-differencing formula (4.4). 

Godunov’s method is often presented in this form. 



4.12 The Wave-Propagation Form of Godunov’s Method 

By taking a slightly different viewpoint, we can also develop simple formulas for Godunov’s 
method on linear systems of equations that are analogous to the form (4.35) for the upwind 
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A 2 At 




Fig. 4.7. An illustration of the process of Algorithm 4.1 for the case of a linear system of three 
equations. The Riemann problem is solved at each cell interface, and the wave structure is used to 
determine the exact solution time At later. The wave W, 2 _| , 2 , for example, has moved a distance A 2 At 
into the cell. 



method on the advection equation. This viewpoint is particularly useful in extending 
Godunov’s method to hyperbolic systems that are not in conservation form. 

Figure 4.7 shows a more complicated version of Figure 4.6, in which a linear system of 
three equations is solved assuming A 1 < 0 < A 2 < A 3 . The function q n (x , t„+ \ ) will typically 
have three discontinuities in the grid cell C;, at the points x,-_ 1/2 + A 2 At, X;_ 1/2 + A 3 At, 
and Xi+ 1/2 + A 1 At. 

Instead of trying to work with this function directly to compute the new cell average, 
recall from Section 3.8 that for a linear system the solution to the Riemann problem can be 
expressed as a set of waves. 



Qi Qi-i — YafLmrP — ^ 

p = 1 p = 1 



(4.38) 



Let’s investigate what effect each wave has on the cell average. Consider the wave denoted 
by W 2 _ 1/2 in Figure 4.7, for example. It consists of a jump in q given by 

Wf _ l/2 = af_ l/ 2 r 2 , 

propagating at speed A 2 , and hence after time At it has moved a distance A 2 At. This wave 
modifies the value of q over a fraction of the grid cell given by k 2 At / Ax. It follows that 
the effect of this wave on the cell average of q is to change the average value by the amount 



A 2 At 
A.r 



W, 



i - 1/2- 



The minus sign arises because the value >V 2 _ 1j/2 measures the jump from right to left, and 
is analogous to the minus sign in (4.28). 

Each of the waves entering the grid cell has an analogous effect on the cell average, and 
the new cell average can be found by simply adding up these independent effects. For the 
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case shown in Figure 4.7, we thus find that 

X 2 At 



Qi = Q" 



Ax 



■ W, 



i- 1/2 



A 3 A t 
Ax 



■ W, 



i — 1/2 ' 



A 1 Af 
Ax 



W}_ 



1 + 1/2 



Ar 



= Qi - — (A 2 W,A 1/2 + A 3 nf 1/2 + X}W} +1/2 ). 



(4.39) 



Note that we use the 2- and the 3-wave originating from jt;_i/ 2 and the 1-wave originat- 
ing from x,+i/ 2 , based on the presumed wave speeds. This can be written in a form that 
generalizes easily to arbitrary hyperbolic systems of m equations. Let 



A + = max(A,0), A = min(A, 0), 



(4.40) 



and suppose the solution of the Riemann problem consists of m waves W p traveling at 
speeds X p , each of which may be positive or negative. Then the cell average is updated by 



e; +! _ g- _ £ 



X>o + wf_ 1/2 + 



p 

+ 1/2 



\_p=l 



P= 1 



(4.41) 



The cell average is affected by all right-going waves from x, _ i / 2 and by all left-going waves 
from x, + i/ 2 . This is a generalization of (4.35). Understanding this formulation of Godunov’s 
method is crucial to understanding many of the other algorithms presented in this book. 
As a shorthand notation, we will also introduce the following symbols: 



a - aq ^ = X> p rw, p _ l/2 , 

p = i 

m 

_4 + AG,_i / 2 = J](AO + Wf_ 1/2 , 

p= i 



(4.42) 



so that (4.41 ) can be rewritten as 

0" +1 = Qi ~ ^{A+AQ.^2 + A~AQ i+1/2 ). (4.43) 

The symbol A + AQj-1/2 should be interpreted as a single entity that measures the net 
effect of all right-going waves from x ( _i/ 2 , while A~ A 2/- 1/2 measures the net effect of all 
left-going waves from this same interface. These net effects will also sometimes be called 
fluctuations. Note that within cell C,-, it is the right-going fluctuation from the left edge, 
A + AQi-\/2, and the left-going fluctuation from the right edge, A~ A Qi + \/ 2 , that affect the 
cell average. 

The notation introduced in (4.42) is motivated by the following observation. For the 
constant-coefficient linear system q, + Aq x — 0, we have 



Wf-1/2 = « 



i—l/2 r 



where r p is the pth eigenvector of A, and the propagation speed is the corresponding 
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eigenvalue ). p . Define the matrices 





-(^)+ 




■a 1 )- 


A + = 


a 2 )+ 


, A” = 


a 2 )- 








(X m )~ _ 



(4.44) 

Thus A + has only the positive eigenvalues on the diagonal, with negative ones replaced by 
zero, and conversely for A - . Now define 

A+ = RA + R~ l and A~ = RA~R~ l , (4.45) 



and note that 



A+ + A” = R( A+ + A ~)R~ l = RAR~ l = A. (4.46) 

This gives a useful splitting of the coefficient matrix A into pieces essential for right-going 
and left-going propagation. Now if we let A2/-1/2 = Qi — Qi- i and multiply this vector 
by A + , we obtain 



A + A0,'_i /2 = RA+R-'iQi - Qi- 1) 

= RA + a ,-\/2 

m 

= E^) + «f-l/2^ 

p=\ 

= A + AQi- l/2 . (4.47) 



Similarly, we compute that 



m 

A-AQi-y /2 = l/2 r p 

P= 1 

= A-AQi- 1/2 . (4.48) 

So in the linear constant-coefficient case, each of the fluctuations _4 + A(2 ,-_i/ 2 and 
A~ AQi-i/ 2 can be computed by simply multiplying the matrix A + or A” by the jump 
in Q. For variable-coefficient or nonlinear problems the situation is not quite so simple, and 
hence we introduce the more general notation (4.42) for the fluctuations, which can still be 
computed by solving Riemann problems and combining the appropriate waves. We will see 
that the form (4.43) of Godunov’s method can still be used. 

For the constant-coefficient linear problem, the wave-propagation form (4.43) of 
Godunov’s method can be related directly to the numerical flux function (4.37). Note 
that the value of q in the Riemann solution along x — x, _ i / 2 is 

qU/ 2 = q\Qi - !. Qi) = Qi - 1 + E 

p:kP< 0 

using the summation notation introduced in (3.19). 
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In the linear case /( Qf_y 2 ) — A 2/ - 1/2 an d so (4-37) gives 

F'Un = AQi - 1 + X! ^_ 1/2 . 

p:kP< 0 

Since W/^i /2 is an eigenvector of A with eigenvalue X p , this can be rewritten as 

m 

FUn = AQi - 1 + Y'WrWf- 1/2- ( 4 -49) 

P =i 

Alternatively, we could start with the formula 

ef-l/2 = Qi - E W "-l/2 

p:XP> 0 

and obtain 

m 

F'U /2 = AQt- E^ P ) + W,-i/2- (4.50) 

p=i 

Similarly, there are two ways to express F" + y 2 . Choosing the form 

m 

F? + 1/2 = AQi + E(^)'Wf+ 1/2 

p=i 

and combining this with (4.50) in the flux-differencing formula (4.4) gives 

Q‘ +1 = Q" ~ F i- 1 / 2 ) 

a . m m 

= Q" - ^ E(^)" W 4t/2 + E(^) +VV "-l/2 • (4-51) 

Lp= 1 p=l 

since the 4 Qj terms cancel out. This is exactly the same expression obtained in (4.41). 
For a more general conservation law q, + f(q) x = 0, we can define 

m 

FP_ 1/2 = /(G/-t) + E(^) _W '-i/ 2 = /(Gi-i) + ^ _ AGi- 1/2 (4.52) 

p=i 



*7-1/2 = /(G.O - E (XP ( +VV - P -i/2 = /(G.O - A + AQi. l/2 , (4.53) 
p = 1 

corresponding to (4.49) and (4.50) respectively, where the speeds and waves W p come 
out of the solution to the Riemann problem. 
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4.13 Flux-Difference vs. Flux- Vector Splitting 

Note that if we subtract (4.52) from (4.53) and rearrange, we obtain 

f(Qi) - f(Qi-i) = A~AQi_ 1/2 + A + AQi- U2 . (4.54) 

This indicates that the terms on the right-hand side correspond to a so-called flux -difference 
splitting. The difference between the fluxes computed based on each of the cell averages 
Qi-i and 0, is split into a left-going fluctuation that updates 0,_i and a right-going 
fluctuation that updates 0, . 

We can define a more general class of flux-difference splitting methods containing any 
method based on some splitting of the flux difference as in (4.54), followed by application 
of the formula (4.43). Such a method is guaranteed to be conservative, and corresponds to 
a flux-differencing method with numerical fluxes 

*1-1/2 = /(fit) - -4+A0/-1/2 = / (0i — i) + A-AQi-yi. (4.55) 

For a linear system, there are other ways to rewrite the numerical flux Ff_ l p that give 
additional insight. Using (4.47) in (4.49), we obtain 

Ff_ i /2 = (A+ + + A~(Q, - Q^) 

= A+Qi-i+A-Qi. (4.56) 

Since A + + A~ — A, the formula (4.56) gives a flux that is consistent with the correct 
flux in the sense of (4.14): If Q/-i = Qi = q, then (4.56) reduces to F/_ 1/2 = Aq = f(q). 
This has a very natural interpretation as a flux-vector splitting. The flux function is f(q) — 
Aq, and so A0,_i and A0, give two possible approximations to the flux at jq_ 1 / 2 . In 
Section 4.5 we considered the possibility of simply averaging these to obtain Ff_ y 2 and 
rejected this because it gives an unstable method. The formula (4.56) suggests instead a 
more sophisticated average in which we take the part of A 0, _i corresponding to right-going 
waves and combine it with the part of A 0, corresponding to left-going waves in order to 
obtain the flux in between. This is the proper generalization to systems of equations of the 
upwind flux for the scalar advection equation given in (4.33). 

This is philosophically a different approach from the flux-difference splitting discussed 
in relation to (4.54). What we have observed is that for a constant-coefficient linear system, 
the two viewpoints lead to exactly the same method. This is not typically the case for 
nonlinear problems, and in this book we concentrate primarily on methods that correspond 
to flux-difference splittings. However, note that given any flux-vector splitting, one can 
define a corresponding splitting of the flux difference in the form (4.54). If we have split 

/ ( 0 / — 1 ) = fi-i + fi ( -i and f(Qi) = f ( - ] + f (+ \ 
and wish to define the numerical flux as 

F n — f 

"i — l/2 — Ji - 1 ^ Ji 



(4.57) 
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then we can define the corresponding fluctuations as 

•A~ AQt-1,2 = - f£, 

(4.58) 

a + aq,_ 1/2 = /;. (+) - f™, 

to obtain a flux-difference splitting that satisfies (4.54) and again yields F"_y 2 via the 
formula (4.55). Flux-vector splittings for nonlinear problems are discussed in Section 15.7. 



4.14 Roe’s Method 

For a constant-coefficient linear problem there is yet another way to rewrite the flux F"_ X / 2 
appearing in (4.49), (4.50), and (4.56), which relates it directly to the unstable naive av- 
eraging of AQi - 1 and AQi given in (4.18). Averaging the expressions (4.49) and (4.50) 
gives 



77« 

l ' 2 ~ 2 



(AG/-! + AQt) - £[(^) + - (VTIW, 

p= i 



p 

i— 1/2 



(4.59) 



Notice that A. + — X — |A.|. Define the matrix |A| by 

| A | = R|A|/? _1 , where |A| = diag(|l p |). 



(4.60) 



Then (4.59) becomes 

FI i/2 = \iAQi-! + AQi) - ^\A\(Qi - Q^) 

= \uiQi- 1) + f(Qi)] - ^|A|(G,- - G/-i). (4.61) 

This can be viewed as the arithmetic average plus a correction term that stabilizes the 
method. 

For the constant-coefficient linear problem this is simply another way to rewrite the 
Godunov or upwind flux, but this form is often seen in extensions to nonlinear problems 
based on approximate Riemann solvers, as discussed in Section 15.3. This form of the flux 
is often called Roe ’s method in this connection. This formulation is also useful in studying 
the numerical dissipation of the upwind method. 

Using the flux (4.61) in the flux-differencing formula (4.4) gives the following updating 
formula for Roe’s method on a linear system: 



ar' = Q1 - ~ MQI, - au) 

1 Ar 



2 



p=\ 



(4.62) 
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This can also be derived directly from (4.41) by noting that another way to express (4.40) is 
A+ = ^(A+|k|), A' = ^a-|l|). (4.63) 

We will see in Section 12.3 that for nonlinear problems it is sometimes useful to modify 
these definitions of A.*. 



Exercises 

4.1. (a) Determine the matrices A + and A~ as defined in (4.45) for the acoustics equa- 

tions (2.50). 

(b) Determine the waves WU /2 and that result from arbitrary data <2;_i 

and Qi for this system. 

4.2. If we apply the upwind method (4.25) to the advection equation q t + uq x = 0 with 
u > 0, and choose the time step so that u At — Ax, then the method reduces to 

Q’! +l = Qt- 1- 

The initial data simply shifts one grid cell each time step and the exact solution is 
obtained, up to the accuracy of the initial data. (If the data Q 9 is the exact cell average 
of q(x), then the numerical solution will be the exact cell average for every step.) 
This is a nice property for a numerical method to have and is sometimes called the 
unit CFL condition. 

(a) Sketch figures analogous to Figure 4.5(a) for this case to illustrate the wave- 
propagation interpretation of this result. 

(b) Does the Lax-Friedrichs method (4.20) satisfy the unit CFL condition? Does 
the two-step Lax-Wendroff method of Section 4.7? 

(c) Show that the exact solution (in the same sense as above) is also obtained for 
the constant-coefficient acoustics equations (2.50) with uq = 0 if we choose 
the time step so that c At — Ax and apply Godunov’s method. Determine the 
formulas for p" +1 and u" +] that result in this case, and show how they are 
related to the solution obtained from characteristic theory. 

(d) Is it possible to obtain a similar exact result by a suitable choice of At in the 
case where uq ^ 0 in acoustics? 

4.3. Consider the following method for the advection equation with u > 0: 

e ; ,+1 = q; - (qi - qu) - ( HA A ~ Ay ) (qu - qu) 

= QU - - i) (QU - QU)- (4-64) 

(a) Show that this method results from a wave-propagation algorithm of the sort 
illustrated in Figure 4.5(a) in the case where Ax < u At < 2 Ax, so that each 
wave propagates all the way through the adjacent cell and part way through the 
next. 
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(b) Give an interpretation of this method based on linear interpolation, similar to 
what is illustrated in Figure 4.4(a). 

(c) Show that this method is exact if u At / Ax — 1 or u At / Ax — 2. 

(d) For what range of Courant numbers is the CFL condition satisfied for this 
method? (See also Exercise 8.6.) 

(e) Determine a method of this same type that works if each wave propagates 
through more than two cells but less than three, i.e., if 2 Ax < u At < 3 Ax. 

Large-time-step methods of this type can also be applied, with limited success, to 

nonlinear problems, e.g., [42], [181], [274], [275], [279], [316], 




■ 5 

Introduction to the CLAWPACK Software 



The basic class of finite volume methods developed in this book has been implemented in 
the software package CLAWPACK. This allows these algorithms to be applied to a wide va- 
riety of hyperbolic systems simply by providing the appropriate Riemann solver, along 
with initial data and boundary conditions. The high-resolution methods introduced in 
Chapter 6 are implemented, but the simple first-order Godunov method of Chapter 4 is 
obtained as a special case by setting the input parameters appropriately. (Specifically, set 
method (2) =1 as described below.) In this chapter an overview of the software is given 
along with examples of its application to simple problems of advection and acoustics. 

The software includes more advanced features that will be introduced later in the book, and 
can solve linear and nonlinear problems in one, two, and three space dimensions, as well as 
allowing the specification of capacity functions introduced in Section 2.4 (see Section 6.16) 
and source terms (see Chapter 17). CLAWPACK is used throughout the book to illustrate 
the implementation and behavior of various algorithms and their application on different 
physical systems. Nearly all the computational results presented have been obtained using 
CLAWPACK with programs that can be downloaded to reproduce these results or investigate 
the problems further. These samples also provide templates that can be adapted to solve 
other problems. See Section 1.5 for details on how to access webpages for each example. 

Only the one-dimensional software is introduced here. More extensive documentation, 
including discussion of the multidimensional software and adaptive mesh refinement capa- 
bilities, can be downloaded from the webpage 

http : //www . amath . Washington. edu/~claw/ 

See also the papers [283], [257] for more details about the multidimensional algorithms, 
and [32] for a discussion of some features of the adaptive mesh refinement code. 



5.1 Basic Framework 

In one space dimension, the CLAWPACK routine clawl (or the simplified version clawlez) 
can be used to solve a system of equations of the form 

K(x)q, + f (q ) x = yjr{q, x, t), (5.1) 
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where q — q(x, t ) e K'". The standard case of a homogeneous conservation law has k = I 
and x/z = 0, 

q, + f (q)x = o. (5.2) 

The flux function f(q) can also depend explicitly on .r and t as well as on q. Hyperbolic 
systems that are not in conservation form, e.g., 

q t + A(x,t)q x - 0, (5.3) 

can also be solved. 

The basic requirement on the homogeneous system is that it be hyperbolic in the sense 
that a Riemann solver can be specified that, for any two states g;-i and Q n returns a set of 
M w waves Wf_ l p and speeds sf_ l ^ 2 satisfying 



M w 

E w f-t/ 2 = Qi-Qi-i = aq,_ 1/2 . 

p = i 

The Riemann solver must also return a left-going fluctuation A~ A Q/-1/2 and a right-going 
fluctuation ^ + Agi-i/ 2 - In the standard conservative case (5.2) these should satisfy 

A-AQ^p + A + AQi_ l/2 = f (Qj) - f(Qi-i) (5.4) 

and the fluctuations then define & flux-difference splitting as described in Section 4.13. 
Typically 

A- AQi- V1 = £ (^_ l/2 )“Wf_ 1/2 , ^ + Ae,_ 1/2 = Y, (^-i/2) +W f-t/2- (5.5) 

p p 

where s~ = mint.v. 0) and ,v + = inaxi.v, 0). In the nonconservative case (5.3), there is no 
flux function f(q), and the constraint (5.4) need not be satisfied. 

Only the fluctuations are used for the first-order Godunov method, which is implemented 
in the form introduced in Section 4.12, 

Q- +l = Q" - ^M + Ae,_ 1/2 + A~AQ i+l/2 ), (5.6) 

assuming k = 1 . 

The Riemann solver must be supplied by the user in the form of a subroutine rpl, as 
described below. Typically the Riemann solver first computes waves and speeds and then 
uses these to compute A + A(2;-i/2 and A" A (2 ,_- 1 / 2 internally in the Riemann solver. The 
waves and speeds must also be returned by the Riemann solver in order to use the high- 
resolution methods described in Chapter 6. These methods take the form 

Q'! +1 = Qi - + A-AQ i+1/2 ) - ^-(F,+ 1/2 - Fi- 1 / 2 ), (5.7) 

A.v A.r 

where 




(5.8) 
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Here W/l , n represents a limited version of the wave , n , obtained by comparing W ; p , „ 

When a capacity function k(x ) is present, the Godunov method becomes 

0" +1 = Q1 ~ -^-(^ + Aj2/-i/2 + «4 _ AQ /+1/2 ), (5.9) 

Ki A.r v 7 

See Section 6.16 for discussion of this algorithm and its extension to the high-resolution 
method. 

If the equation has a source term, a routine srcl must also be supplied that solves the 
source-term equation q, — \jriq . k) over a time step. A fractional-step method is used to 
couple this with the homogeneous solution, as described in Chapter 17. Boundary condi- 
tions are imposed by setting values in ghost cells each time step, as described in Chapter 7. 
A few standard boundary conditions are implemented in a library routine, but this can be 
modified to impose other conditions; see Section 5.4.4. 

5.2 Obtaining CLAWPACK 

The latest version of CLAWPACK can be downloaded from the web, at 

http : //www . amath . Washington. edu/~claw/ 

Go to “download software” and select the portion you wish to obtain. At a minimum, you 
will need 

claw/ clawpack 

If you plan to use Matlab to plot results, some useful scripts are in 
claw/matlab 

Other plotting packages can also be used, but you will have to figure out how to properly 
read in the solution produced by CLAWPACK. 

The basic CLAWPACK directories Id, 2d, and 3d each contain one or two examples in 
directories such as 

claw/ ld/examplel 

that illustrate the basic use of CLAWPACK. The directory 
claw/book 

contains drivers and data for all the examples presented in this book. You can download this 
entire directory or selectively download specific examples as you need them. Some other 
applications of CLAWPACK can be found in 

claw/ applications 



5.3 Getting Started 

The discussion here assumes you are using the Unix (or Linux) operating system. The Unix 
prompt is denoted by unix>. 
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5.3.1 Creating the Directories 

The files you download will be gzipped tar files. Before installing any of CLAWPACK, you 
should create a directory named <path>/claw where the pathname <path> depends on 
where you want these files to reside and the local naming conventions on your computer. 
You should download any CLAWPACK or claw/book files to this directory. After down- 
loading any file of the form name . tar . gz , execute the following commands: 

unix> gunzip name. tar. gz 
unix> tar -xvf name. tar 

This will create the appropriate subdirectories within <path>/claw. 



5.3.2 Environment variables for the path 

You should now set the environment variable CLAW in Unix so that the proper files can be 
found: 

unix> setenv CLAW <path>/ claw 

You might want to put this line in your . cshrc file so it will automatically be executed 
when you log in or create a new window. Now you can refer to $CLAW/clawpack/ld, for 
example, and reach the correct directory. 



5.3.3 Compiling the code 

Go to the directory claw/clawpack/ld/examplel. There is a file in this directory named 
compile, which should be executable so that you can type 

unix> compile 

This should invoke f 77 to compile all the necessary files and create an executable called 
xclaw. To run the program, type 

unix> xclaw 

and the program should run, producing output files that start with fort. In particular, 
fort . qOOOO contains the initial data, and fort . qOOOl the solution at the first output time. 
The file fort . info has some information about the performance of CLAWPACK. 



5.3.4 Makefiles 

The compile file simply compiles all of the routines needed to run CLAWPACK on this 
example. This is simple, but if you make one small change in one routine, then everything 
has to be recompiled. Instead it is generally easier to use a Makef ile, which specifies what 
set of object files (ending with . o) are needed to make the executable, and which Fortran 
files (ending with . f ) are needed to make the object files. If a Fortran file is changed, then it 
is only necessary to recompile this one rather than everything. This is done simply by typing 



unix> make 




5.4 Using CLAWPACK - a Guide through example 1 
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A complication arises in that the example 1 directory only contains a few of the necessary 
Fortran files, the ones specific to this particular problem. All the standard CLAWPACK files 
are in the directory claw/clawpack/ld/lib. You should first go into that directory and 
type make to create the object files for these library routines. This only needs to be done 
once if these files are never changed. Now go to the examplel directory and also type make. 
Again an executable named xclaw should be created. See the comments at the start of the 
Makefile for some other options. 



5.3.5 Matlab Graphics 

If you wish to use Matlab to view the results, you should download the directory 
cl aw/mat lab and then set the environment variable 

unix> setenv MATLABPATH " . : \$CLAW/matlab" 

before starting Matlab, in order to add this directory to your Matlab search path. This 
directory contains the plotting routines plotclawl . m and plotclaw2 . m for plotting results 
in one and two dimensions respectively. 

With Matlab running in the examplel directory, type 

Matlab> plotclawl 

to see the results of this computation. You should see a pulse advecting to the right with veloc- 
ity 1 , and wrapping around due to the periodic boundary conditions applied in this example. 



5.4 Using CLAWPACK - a Guide through examplel 
The program in claw/clawpack/ld/examplel solves the advection equation 

q, + uq x — 0 

with constant velocity u = 1 and initial data consisting of a Gaussian hump 

q(x, 0) = exp(— /3(x — 0.3) 2 ). (5.10) 

The parameters u — 1 and f J > — 200 are specified in the file setprob . data. These values 
are read in by the routine setprob . f described in Section 5.5. 



5.4.1 The Main Program (driver . f ) 

The main program is located in the file driver . f . It simply allocates storage for the arrays 
needed in CLAWPACK and then calls clawlez, described below. Several parameters are set 
and used to declare these arrays. The proper values of these parameters depends on the 
particular problem. They are: 

maxmx : The maximum number of grid cells to be used. (The actual number mx is later 
read in from the input file clawlez . data and must satisfy mx < maxmx.) 
meqn: The number of equations in the hyperbolic system, e.g., meqn= 1 for a scalar 
equation, meqn = 2 for the acoustics equations (2.50), etc. 
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mwaves : The number of waves produced in each Riemann solution, called M w in the 
text. Often mwaves =meqn, but not always. 

mbc : The number of ghost cells used for implementing boundary conditions, as de- 
scribed in Chapter 7. Setting mbc = 2 is sufficient unless changes have been made to 
the CLAWPACK software resulting in a larger stencil. 

mwork : A work array of dimension mwork is used internally by CLAWPACK for various 
purposes. How much space is required depends on the other parameters: 

mwork > (maxmx + 2*mbc) * (2 + 4*meqn + mwaves + meqn*mwaves) 

If the value of mwork is set too small, CLAWPACK will halt with an error message telling 
how much space is required. 

maux : The number of “auxiliary” variables needed for information specifying the prob- 
lem, which is used in declaring the dimensions of the array aux (see below). 

Three arrays are declared in driver . f : 

q(l-mbc : maxmx +mbc , meqn) : This array holds the approximation Q " (a vector with 
meqn components) at each time t n . The value of i ranges from 1 to mx where mx 
<= maxmx is set at run time from the input file. The additional ghost cells numbered 
(1-mbc) :0 and (mx+1) : (mx+mbc) are used in setting boundary conditions. 

work (mwork) : Used as work space. 

aux ( 1-mbc :maxmx+mbc, maux): Used for auxiliary variables if maux > 0. For ex- 
ample, in a variable-coefficient advection problem the velocity in the i th cell might be 
stored in aux(i , 1) . See Section 5.6 for an example and more discussion. If maux = 0, 
then there are no auxiliary variables, and aux can simply be declared as a scalar or not 
declared at all, since this array will not be referenced. 



5.4.2 The Initial Conditions ( qinit . f ) 

The subroutine qinit . f sets the initial data in the array q. For a system with meqn com- 
ponents, q(i,m) should be initialized to a cell average of the mth component in the ith 
grid cell. If the data is given by a smooth function, then it may be simplest to evaluate this 
function just at the center of the cell, which agrees with the cell average to 0(( A.r) 2 ). The 
left edge of the cell is at xlower + (i— 1) *dx, and the right edge is at xlower + i*dx. It is 
only necessary to set values in cells i = 1 :mx, not in the ghost cells. The values of xlower, 
dx, and mx are passed into qinit . f , having been set in clawlez. 



5.4.3 The clawlez Routine 

The main program driver . f sets up array storage and then calls the subroutine clawlez, 
which is located in claw/clawpack/ld/lib, along with other standard CLAWPACK sub- 
routines described below. The clawlez routine provides an easy way to use CLAWPACK 
that should suffice for many applications. It reads input data from a file clawlez . data, 
which is assumed to be in a standard form described below. It also makes other assumptions 
about what the user is providing and what type of output is desired. After checking the 
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inputs for consistency, clawlez calls the CLAWPACK routine clawl repeatedly to produce 
the solution at each desired output time. 

The clawl routine (located in claw/clawpack/ld/lib/clawl . f) is much more gen- 
eral and can be called directly by the user if more flexibility is needed. See the documentation 
in the source code for this routine. 



5.4.4 Boundary Conditions 

Boundary conditions must be set in every time step, and clawl calls a subroutine bcl in 
every step to accomplish this. The manner in which this is done is described in detail in 
Chapter 7. For many problems the choice of boundary conditions provided in the default 
routine claw/clawpack/ ld/lib/bcl . f will be sufficient. For other boundary conditions 
the user must provide an appropriate routine. This can be done by copying the be 1 . f routine 
to the application directory and modifying it to insert the appropriate boundary conditions 
at the points indicated. 

When using clawlez, the clawlez. data file contains parameters specifying what 
boundary condition is to be used at each boundary (see Section 5.4.6, where the mthbc 
array is described). 



5.4.5 The Riemann Solver 

The file claw/clawpack/ld/examplel/rplad.f contains the Riemann solver, a sub- 
routine that should be named rpl if clawlez is used. (More generally the name of the 
subroutine can be passed as an argument to clawl.) The Riemann solver is the crucial 
user-supplied routine that specifies the hyperbolic equation being solved. The input data 
consists of two arrays ql and qr. The value ql (i , : ) is the value Q f at the left edge of 
the ith cell, while qr(i, : ) is the value Qf at the right edge of the ith cell, as indicated in 
Figure 5.1. Normally ql = qr and both values agree with Q " , the cell average. More flexibil- 
ity is allowed because in some applications, or in adapting CLAWPACK to implement different 
algorithms, it is useful to allow different values at each edge. For example, we might want 
to define a piecewise linear function within the grid cell as illustrated in Figure 5.1 and 




Fig. 5.1. The states used in solving the Riemann problem at the interface ,q_ i/ 2 . 
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then solve the Riemann problems between these values. This approach to high-resolution 
methods is discussed in Section 10.1. 

Note that the Riemann problem at the interface jc,- — 1/2 between cells i — 1 and i has data 



left state: Qf~\ — qr(i — 1, :), 

right state: — ql(i, :). 



(5.11) 



This notation is potentially confusing in that normally we use cp to denote the left state and 
q r to denote the right state in specifying Riemann data. The routine rpl must solve the 
Riemann problem for each value of i, and return the following: 

amdq(i , 1 :meqn) , the vector ^4“ A C?/— 1/2 containing the left-going fluctuation as 
described in Section 4.12. 

apdq(i , 1 :meqn) , the vector 4 + A£?;-i/2 containing the right-going fluctuation as 
described in Section 4.12. 

wave ( i , 1 : meqn , p) , the vector Wf_ l p representing the jump in cj across the pth wave in 
the Riemann solution at .r,_i/ 2 , for p — 1,2,..., mwaves. (In the code mw is typically 
used in place of p.) 

s (i ,p) , the wave speed l j 2 f° r each wave. 

For Godunov’s method, only the fluctuations amdq and apdq are actually used, and the 
update formula (5.6) is employed. The waves and speeds are only used for high-resolution 
correction terms as described in Chapter 6. 

For the advection equation, the Riemann solver in example 1 returns 



wave(i, 1, 1) = ql(i) — qr(i — 1), 
s(i, 1) = u, 

amdq(i, 1) = min(«, 0)*wave(i, 1, 1), 
apdq(i, 1) = max(n, 0)*wave(i, 1, 1). 

Sample Riemann solvers for a variety of other applications can be found in claw/book 
and claw/applications. Often these can be used directly rather than writing a new 
Riemann solver. 



5.4.6 The Input File clawlez.data 

The clawlez routine reads data from a file named clawlez . data. Figure 5.2 shows the file 
from examplel. Typically one value is read from each line of this file. Any text following 
this value on each line is not read and is there simply as documentation. The values read are: 

mx: The number of grid cells for this computation. (Must have mx < maxmx, where 
maxmx is set in driver . f .) 

nout : Number of output times at which the solution should be written out. 
out style: There are three possible ways to specify the output times. This parameter 
selects the desired manner to specify the times, and affects what is required next, 
outstyle = 1 : The next line contains a single value tf inal. The computation should 
proceed to this time, and the nout outputs will be at times t0+(tfinal- 
tO) /nout, where the initial time tO is set below. 
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50 


mx 


— 


cells in x direction 


11 


nout 


= 


number of output times to print results 


1 


outstyle 


= 


style of specifying output times 


2 . 2d0 


tf inal 


= 


final time 


0. ldO 


dtv(l) 


= 


initial dt (used in all steps if method(l)=0) 


1 . 0d99 


dtv(2) 


= 


max allowable dt 


l.OdO 


cf lv(l) 


= 


max allowable Courant number 


0 . 9d0 


cf lv(2) 


= 


desired Courant number 


500 


nv(l) 


= 


max number of time steps per call to clawl 


1 


method(l) 


= 


1 for variable dt 


2 


method(2) 


= 


order 


0 


method(3) 


= 


not used in one dimension 


1 


method(4) 


= 


verbosity of output 


0 


method(5) 


= 


source term splitting 


0 


method(6) 


= 


mcapa 


0 


method(7) 


= 


maux (should agree with parameter in driver) 


1 


meqn 


= 


number of equations in hyperbolic system 


1 


mwaves 


= 


number of waves in each Riemann solution 


3 


mthlim(mw) 


= 


limiter for each wave (mw=l ,mwaves) 


O.dO 


to 


= 


initial time 


O.OdO 


xlower 


= 


left edge of computational domain 


l.OdO 


xupper 


— 


right edge of computational domain 


2 


mbc 


= 


number of ghost cells at each boundary 


2 


mthbc (1) 


= 


type of boundary conditions at left 


2 


mthbc (2) 


= 


type of boundary conditions at right 



Fig. 5.2. A typical clawlez.data file, from claw/clawpack/ld/examplel for advection. 



outstyle = 2 : The next line(s) contain a list of nout times at which the outputs are 
desired. The computation will end when the last of these times is reached, 
outstyle = 3 : The next line contains two values 
nstepout, nsteps 

A total of nsteps time steps will be taken, with output after every nstepout time 
steps. The value of nout is ignored. This is most useful if you want to insure that 
time steps of maximum length are always taken with a desired Courant number. 
With the other output options, the time steps are adjusted to hit the desired times 
exactly. This option is also useful for debugging if you want to force the solution to 
be output every time step, by setting nstepout = 1. 
dt v ( 1 ) : The initial value of At used in the first time step. If method ( 1 ) = 0 below, then 
fixed-size time steps are used and this is the value of At in all steps. In this case At 
must divide the time increment between all requested outputs an integer number of 
times. 

dtv(2) : The maximum time step At to be allowed in any step (in the case where 
method! 1) =1 and variable At is used). Variable time steps are normally chosen 
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based on the Courant number, and this parameter can then simply be set to some very 
large value so that it has no effect. For some problems, however, it may be necessary to 
restrict the time step to a smaller value based on other considerations, e.g., the behavior 
of source terms in the equations. 

cf lv(l) : The maximum Courant number to be allowed. The Courant number is calcu- 
lated after all the Riemann problems have been solved by determining the maximum 
wave speed seen. If the Courant number is no larger than cflv(l), then this step is 
accepted. If the Courant number is larger, then: 
method (1)=0: (fixed time steps), the calculation aborts. 

method(l)=l: (variable time steps), the step is rejected and a smaller time step is 
taken. 

Usually cf lv(l) = 1 can be used. 

cflv(2) : The desired Courant number for this computation. Used only if method(l)=l 
(variable time steps). In each time step, the next time increment At is based on the 
maximum wave speed found in solving all Riemann problems in the previous time 
step. If the wave speeds do not change very much, then this will lead to roughly the 
desired Courant number. It’s typically best to take cf lv (2) to be slightly smaller than 
cf lv(l) , say cf lv(2) =0.9. 

nv(l) : The maximum number of time steps allowed in any single call to clawl. This 
is provided as a precaution to avoid too lengthy runs. 
method(l) : Tells whether fixed or variable size time steps are to be used, 
method (1) = 0 : A fixed time step of size dtv(l) will be used in all steps, 
method ( 1 ) = 1 : CLAWPACK will automatically select the time step as described above, 
based on the desired Courant number, 
method (2) : The order of the method. 

method (2) = 1 : The first-order Godunov’s method described in Chapter 4 is used, 
method (2) = 2 : High-resolution correction terms are also used, as described in 
Chapter 6. 

method (3) : This parameter is not used in one space dimension. In two and three di- 
mensions it is used to further specify which high-order correction terms are applied, 
method (4) : This controls the amount of output printed by clawl on the screen as 
CLAWPACK progresses. 

method (4) = 0 : Information is printed only when output files are created, 
method (4) = 1 : Every time step the value At and Courant number are reported. 
method(5) : Tells whether there is a source term in the equation. If so, then a fractional- 
step method is used as described in Chapter 17. Time steps on the homogeneous 
hyperbolic equation are alternated with time steps on the source term. The solution 
operator for the source terms must be provided by the user in the routine srcl . f . 
method (5) = 0 : There is no source term. In this case the default routine 

claw/clawpack/ld/lib/srcl . f can be used, which does nothing, and in fact 
this routine will never be called. 

method(5) = 1 : A source term is specified in srcl . f , and the first-order (Godunov) 
fractional-step method should be used. 

method(5) = 2 : A source term is specified in srcl . f , and a Strang splitting is used. 
The Godunov splitting is generally recommended rather than the Strang splitting, for 
reasons discussed in Chapter 17. 
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method (6) : Tells whether there is a “capacity function” in the equation, as introduced 
in Section 2.4. 

method(6) = 0 : No capacity function; k = 1 in (2.27). 

method(6) =mcapa>0: There is a capacity function, and the value of k in the / th 
cell is given by aux (i ,mcapa) , i.e., the mcapa component of the aux array is used 
to store this function. In this case capacity-form differencing is used, as described 
in Section 6.16. 

method (7) : Tells whether there are any auxiliary variables stored in an aux array. 
method(7) =0: No auxiliary variables. In this case the array aux is not referenced 
and can be a dummy variable. 

method(7) =maux > 0 : There is an aux array with maux components. In this case 
the array must be properly declared in driver . f . 

Note that we must always have maux > mcapa. The value of method(7) specified 
here must agree with the value of maux set in driver . f . 
meqn : The number of equations in the hyperbolic system. This is also set in driver . f 
and the two should agree. 

mwaves : The number of waves in each Riemann solution. This is often equal to meqn 
but need not be. This is also set in driver . f , and the two should agree. 
mthlimCl : mwaves) : The limiter to be applied in each wave family as described in 
Chapter 6. Several different limiters are provided in CLAWPACK [see (6.39)]; 
mthlim(mw) = 0 : No limiter (Lax-Wendroff) 
mthlim(mw) = 1 : Minmod 
mthlim(mw) = 2: Superbee 
mthlim(mw) =3: van Leer 
mthlim(mw) = 4: MC (monotonized centered) 

Other limiters can be added by modifying the routine 
claw/clawpack/ld/lib/philim. f , which is called by 
claw/ clawpack/ld/lib/limiter . f . 
tO : The initial time. 

xlower : The left edge of the computational domain, 
xupper : The right edge of the computational domain. 

mbc : The number of ghost cells used for setting boundary conditions. Usually mbc = 2 is 
used. See Chapter 7. 

mthbc(l) : The type of boundary condition to be imposed at the left boundary. See 
Chapter 7 for more description of these and how they are implemented. The following 
values are recognized: 

mthbc ( 1 ) = 0 ; The user will specify a boundary condition. In this case you must copy 
the file claw/ clawpack/ld/lib/bcl . f to your application directory and modify 
it to insert the proper boundary conditions in the location indicated, 
mthbc ( 1 ) = 1 : Zero-order extrapolation. 

mthbc (1) = 2: Periodic boundary conditions. In this case you must also set 
mthbc (2) = 2. 

mthbc ( 1 ) = 3 : Solid wall boundary conditions. This set of boundary conditions only 
makes sense for certain systems of equations; see Section 7.3.3. 
mthbc (2) : The type of boundary condition to be imposed at the right boundary. The 
same values are recognized as described above. 
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5.5 Other User-Supplied Routines and Files 

Several other routines may be provided by the user but are not required. In each case there 
is a default version provided in the library claw/clawpack/ld/lib that does nothing 
but return. If you wish to provide a version, copy the library version to the application 
directory, modify it as required, and also modify the Makefile to point to the modified 
version rather than to the library version. 

setprob . f The clawlez routine always calls setprob at the beginning of execution. 
The user can provide a subroutine that sets any problem-specific parameters or does 
other initialization. For the advection problem solved in examplel, this is used to 
set the advection velocity u. This value is stored in a common block so that it can be 
passed into the Riemann solver, where it is required. The parameter beta is also set and 
passed into the routine qinit . f for use in setting the initial data according to (5.10). 
When clawlez is used, a setprob subroutine must always be provided. If there is 
nothing to be done, the default subroutine claw/clawpack/ld/lib/setprob . f can 
be used, which does nothing but return. 

setaux . f The clawlez routine calls a subroutine setaux before the first call to clawl. 
This routine should set the array aux to contain any necessary data used in specifying 
the problem. For the example in examplel no aux array is used (maux = 0 in driver . f ) 
and the default subroutine claw/clawpack/ld/lib/setaux . f is specified in the 
Makefile. 

b4stepl . f Within clawl there is a call to a routine b4stepl before each call to stepl 
(the CLAWPACK routine that actually takes a single time step). The user can supply a 
routine b4stepl in place of the default routine claw/clawpack/ld/lib/b4stepl . f 
in order to perform additional tasks that might be required each time step. One example 
might be to modify the aux array values each time step, as described in Section 5.6. 

srcl .f If the equation includes a source term if as in (5.1), then a routine srcl must 
be provided in place of the default routine claw/clawpack/ld/lib/srcl .f. This 
routine must solve the equation q t — ir over one time step. Often this requires solving 
an ordinary differential equation in each grid cell. In some cases a partial differential 
equation must be solved, for example if diffusive terms are included with x[r — q xx , 
then the diffusion equation must be solved over one time step. 



5.6 Auxiliary Arrays and setaux . f 

The array q(i,l:meqn) contains the finite-volume solution in the ith grid cell. Often 
other arrays defined over the grid are required to specify the problem in the first place. For 
example, in a variable-coefficient advection problem 

q, + u(x)q x = 0 

the Riemann solution at any cell interface x,_i /2 depends on the velocities «,_i and . 
The aux array can be used to store these values and pass them into the Riemann solver. 
In the advection example we need only one auxiliary variable, so maux = 1 and we store 
the velocity u, in aux(i,l). See Chapter 9 for more discussion of variable-coefficient 
problems. 




Exercises 
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Of course one could hard-wire the specific function u(x) into the Riemann solver or pass 
it in using a common block, but the use of the auxiliary arrays gives a uniform treatment of 
such data arrays. This is useful in particular when adaptive mesh refinement is applied, in 
which case there are many different q grids covering different portions of the computational 
domain and it is very convenient to have an associated aux array corresponding to each. 

The clawlez routine always calls a subroutine setaux before beginning the compu- 
tation. This routine, normally stored in setaux. f, should set the values of all auxiliary 
arrays. If maux = 0 then the default routine claw/clawpack/ld/lib/setaux . f can be 
used, which does nothing. See Section 5.6 for an example of the use of auxiliary arrays. 

In some problems the values stored in the aux arrays must be time-dependent, for 
example in an advection equation of the form q, + uix , t)q x = 0. The routine setaux 
is called only once at the beginning of the computation and cannot be used to mod- 
ify values later. The user can supply a routine b4stepl in place of the default routine 
claw/clawpack/ld/lib/b4stepl .f in order to modify the aux array values each time 
step. 



5.7 An Acoustics Example 

The directory claw/clawpack/ld/example2 contains a sample code for the constant- 
coefficient acoustics equations (2.50). The values of the density and bulk modulus are set 
in setprob.f (where they are read in from a data file setprob . data). In this routine 
the sound speed and impedance are also computed and passed to the Riemann solver in a 
common block. The Riemann solver uses the formulas (3.31) to obtain a 1 and a 1 , and then 
the waves are W p = a p r p . The boundary conditions are set for a reflecting wall at x — — 1 
and nonreflecting outflow at x = 1 . 



Exercises 

The best way to do these exercises, or more generally to use CLAWPACK on a new problem, 
is to copy an existing directory to a new directory with a unique name and modify the 
routines in that new directory. 

5.1. The example in claw/clawpack/ld/example2 has method(2)=2 set in 
clawlez . data, and hence uses a high-resolution method. Set method (2) =1 to 
use the upwind method instead, and compare the computed results. 

5.2. Modify the data from Exercise 5.1 to take time steps for which the Courant number 
is 1.1. (You must change both cf lv(l) and cf lv(2) in clawlez . data.) Observe 
that the upwind method is unstable in this case. 

5.3. The initial data in claw/clawpack/ld/example2/qinit . f has q 2 (x, 0) = 0 and 
hence the initial pressure pulse splits into left going and right going pulses. Modify 
q 2 (x . 0) so that the initial pulse is purely left going. 




High-Resolution Methods 



In Chapter 4 we developed the basic ideas of Godunov’s method, an upwind finite vol- 
ume method for hyperbolic systems, in the context of constant-coefficient linear systems. 
Godunov’s method is only first-order accurate and introduces a great deal of numerical 
diffusion, yielding poor accuracy and smeared results, as can be seen in Figure 6.1(a), for 
example. In this chapter we will see how this method can be greatly improved by introducing 
correction terms into (4.43), to obtain a method of the form 

Ql +l = Qi - ^i A+ &Qi-i/2 + A ~AQ i+l/2 ) - ^-(F i+1/2 - Fi-w). (6.1) 

The fluxes F i- \/2 are based on the waves resulting from the Riemann solution, which have 
already been computed in the process of determining the fluctuations _4 ± A(9,_ 1( /2. The 
basic form of these correction terms is motivated by the Lax-Wendroff method, a standard 
second-order accurate method described in the next section. The addition of crucial limiters 
leads to great improvement, as discussed later in this chapter. 



6.1 The Lax-Wendroff Method 

The Lax-Wendroff method for the linear system q, + Aq x — 0 is based on the Taylor series 
expansion 



1 2 

q(x, t n+ 1 ) = q(x, t„) + At q t (x, t n ) + -(At) q„(x, t„) H . (6.2) 

From the differential equation we have that q, — —Aq x , and differentiating this gives 

q i ! — Aq xt — A~q xx , 

where we have used q xt — q tx — (—Aq x ) x . Using these expressions for q, and q tt in (6.2) 
gives 



qtx, t n+ 1 ) = q{x, t n ) - At Aq x tx, t n ) + -(A t) 2 A 2 q xx (x, t n ) H . 



(6.3) 
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Upwind at t=1 Upwind at t = 5 





Fig. 6.1. Tests on the advection equation with different linear methods. Results at time t = 1 and t = 5 
are shown, corresponding to 1 and 5 revolutions through the domain in which the equation q, + q x = 0 
is solved with periodic boundary conditions: (a) upwind, (b) Lax-Wendroff, (c) Beam-Warming. 
[claw/book/chap6/ compareadv] 



Keeping only the first three terms on the right-hand side and replacing the spatial derivatives 
by central finite difference approximations gives the Lax-Wendroff method, 



Q'l + = Q'l - 



At 
2 Ax 



A (Ql 



+i 



QU) 




2 Q1 + Q'! + 1 ). 



(6.4) 



By matching three terms in the Taylor series and using centered approximations, we obtain 
a second-order accurate method. 

This derivation of the method is based on a finite difference interpretation, with Q" 
approximating the pointwise value q{x,, t n ). However, we can reinterpret (6.4) as a finite 
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volume method of the form (4.4) with the flux function 

F-- 1/2 = ^(0'U + Qi ) - \^- x A \Q n i - Qi- 1)- (6.5) 

Note that this looks like the unstable averaged flux (4.18) plus a diffusive flux, but that the 
diffusion chosen exactly matches what appears in the Taylor series expansion (6.3). Indeed, 
this shows why the averaged flux (4.18) alone is unstable - the Taylor series expansion for 
the true solution contains a diffusive q xx term that is missing from the numerical method 
when the unstable flux is used. 

To compare the typical behavior of the upwind and Lax-Wendroff methods. Figure 6. 1 
shows numerical solutions to the scalar advection equation q, + q x = 0, which is solved on 
the unit interval up to time t — 1 with periodic boundary conditions. Hence the solution 
should agree with the initial data, translated back to the initial location. The data, shown as a 
solid line in each plot, consists of both a smooth pulse and a square-wave pulse. Figure 6.1(a) 
shows the results when the upwind method is used. Excessive dissipation of the solution 
is evident. Figure 6.1(b) shows the results when the Lax-Wendroff method is used instead. 
The smooth pulse is captured much better, but the square wave gives rise to an oscillatory 
solution. This can be explained in terms of the Taylor series expansion (6.2) as follows. 
By matching the first three terms in the series expansion, the dominant error is given by 
the next term, q ttt — ~ A ] q xxx . This is a dispersive term, which leads to oscillations, as 
explained in more detail in Section 8.6 where modified equations are discussed. In this 
chapter we will see a different explanation of these oscillations, along with a cure based on 
limiters. 

In each of these figures the results were computed using a Courant number At / Ax = 0.8. 
Choosing different values gives somewhat different results, though the same basic behavior. 
Each method works best when the Courant number is close to 1 (and in fact is exact if the 
Courant number is exactly 1 for this simple problem) and less well for smaller values of 
At/ Ax . The reader is encouraged to experiment with the CLAWPACK codes in the directories 
referenced in the figures. 



6.2 The Beam- Warming Method 

The Lax-Wendroff method (6.4) is a centered three-point method. If we have a system for 
which all the eigenvalues of A are positive (e.g., the scalar advection equation with u > 0), 
then we might think it is preferable to use a one-sided formula. In place of the centered 
formula for q x and q xx , we might use 

1 2 

q.x(xi, t n ) = — — [3q(x h t „ ) - 4q(Xi-i,t n ) + q{x f - 2 , L )] + 0( Ax ), 

2 Ax 

l ( 6 . 6 ) 

qxxixi, t„) = ——r[q(Xi, t„) - 2q(xj-u t„) + q(Xi - 2 , t n )] + 0( Ax). 

(A xY 



Using these in (6.3) gives a method that is again second-order accurate. 
At 



Q" +1 = QI 



2 Ax 



A(3Q" 



4 QU + 07 - 2 ) + 2 a2 ( q ? - 2Q i-' + 



(6.7) 




6.3 Preview of Limiters 



103 



This is known as the Beam-Warming method, and was originally introduced in [481]. It can 
be written as a flux-differencing finite volume method with 

F’Un = AQU + l -_A (i - (QU - QU). (6.8) 

Figure 6.1(c) shows the results of the previous advection test using the Beam-Warming 
method. The behavior is similar to that of the Lax-Wendroff method in that oscillations 
appear, though the oscillations are now ahead of the discontinuities rather than behind. 



6.3 Preview of Limiters 

Second-order accurate methods such as the Lax-Wendroff or Beam-Warming give much 
better accuracy on smooth solutions than the upwind method, as seen in Figure 6.1, but 
fail near discontinuities, where oscillations are generated. In fact, even when the solution is 
smooth, oscillations may appear due to the dispersive nature of these methods, as evident 
in Figure 6. 1 . Upwind methods have the advantage of keeping the solution monotonically 
varying in regions where the solution should be monotone, even though the accuracy is not 
very good. The idea with high-resolution methods is to combine the best features of both 
methods. Second-order accuracy is obtained where possible, but we don’t insist on it in 
regions where the solution is not behaving smoothly (and the Taylor series expansion is not 
even valid). With this approach we can achieve results like those shown in Figure 6.2. 

The dispersive nature of the Lax-Wendroff method also causes a slight shift in the 
location of the smooth hump, a phase error, that is visible in Figure 6.1, particularly 
at the later time t — 5. Another advantage of using limiters is that this phase error can 
be essentially eliminated. Figure 6.3 shows a computational example where the initial 
data consists of a wave packet, a high-frequency signal modulated by a Gaussian. With a 
dispersive method such a packet will typically propagate at an incorrect speed corresponding 
to the numerical group velocity of the method. The Lax-Wendroff method is clearly quite 
dispersive. The high-resolution method shown in Figure 6.3(c) performs much better. There 
is some dissipation of the wave, but much less than with the upwind method. The main goal 
of this chapter is to develop the class of high-resolution methods used to obtain these results. 

A hint of how this can be done is seen by rewriting the Lax-Wendroff flux (6.5) as 

FI t/2 = {A-Q- + A + QU ) + l -\A\ (l - |^|A|) (Q'; - QU), (6.9) 

using the notation A” , A + , | A | defined in Section 4.12. This now has the form of the upwind 
flux (4.56) with a correction term. Using this in the flux-differencing method (4.4) gives a 
method of the form (6.1). Note that the correction term in (6.9) looks like a diffusive flux, 
since it depends on Q" — <2”_i and has the form of (4.10), but the coefficient is positive 
if the CFL condition is satisfied. Hence it corresponds to an anticlijfusive flux that has the 
effect of sharpening up the overly diffusive upwind approximation. 

The idea is now to modify the final term in (6.9) by applying some form of limiter that 
changes the magnitude of the correction actually used, depending on how the solution is 
behaving. The limiting process is complicated by the fact that the solution to a hyperbolic 
system typically consists of a superposition of waves in several different families. At a given 
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Minmod at t = 1 Minmod at t = 5 





Fig. 6.2. Tests on the advection equation with different high-resolution methods, as in Figure 6.1: 
(a) minmod limiter, (b) superbee limiter, (c) MC limiter. [claw/book/chap6/compareadv] 



point and time, some of the waves passing by may be smooth while others are discontinuous. 
Ideally we would like to apply the limiters in such a way that the discontinuous portion 
of the solution remains nonoscillatory while the smooth portion remains accurate. To do 
so we must use the characteristic structure of the solution. We will see that this is easily 
accomplished once we have solved the Riemann problem necessary to implement the upwind 
Godunov method. The second-order correction terms can be computed based on the waves 
arising in that Riemann solution, with each wave limited independently from the others. 
This process is fully described later in this chapter. 

More generally, one can consider combining any low-order flux formula TiXQi- 1, Qi) 
(such as the upwind flux) and any higher-order formula TniQi- j, Qi) (such as the 
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Upwind at t = 1 Upwind at t = 10 





Fig. 6.3. Tests on the advection equation with different methods on a wave packet. Results at time t = 1 
and t = 10 are shown, corresponding to 1 and 10 revolutions through the domain in which the equation 
q, + q x = 0 is solved with periodic boundary conditions, [claw/book/ chap6/wavepacket] 

Lax-Wendroff ) to obtain a flux-limiter method with 

F". 1/2 = TAQi- 1, Qi)+^U/2[^H(Qi-u Qi) - TtXQi-x, Q>)]. (6.10) 

If < t > "_i /2 = 0, then this reduces to the low-order method, while if = 1, we obtain the 

high-order method. This idea of applying limiters to improve the behavior of high-order 
methods appeared in the early 1970s in the hybrid method of Harten and Zwas [190] and 
the flux-corrected transport (FCT) method of Boris and Book [38] (see also [348], [496]). 
A wide variety of other methods of this form have since been developed, along with better 










106 



6 High-Resolution Methods 



theoretical techniques to analyze them. In this chapter we combine many of these ideas to 
develop a class of methods that is relatively easy to extend to harder problems. 

In the next section we start by giving a geometric interpretation for the scalar advection 
equation, leading to slope-limiter methods of the type pioneered in van Leer’s work [464]- 
[468]. For the scalar advection equation there are many ways to interpret the same method, 
and it is illuminating to explore these. In particular we will see how this relates to flux- 
limiter methods of the type studied by Sweby [429], who used the algebraic total variation 
diminishing (TVD) conditions of Harten [179] to derive conditions that limiter functions 
should satisfy for more general nonlinear conservation laws. We will, however, ultimately 
use a different approach to apply these limiters to nonlinear problems, closer to the geometric 
approach in Goodman & LeVeque [160]. This can be interpreted as applying the limiter 
functions to the waves resulting from the Riemann solution. Extending this to linear systems 
of equations gives the algorithm introduced in Section 6.13. The method is then easily 
generalized to nonlinear systems, as described briefly in Section 6.15 and more fully in 
Chapter 15. Multidimensional versions are discussed in Chapters 19 through 23. 



6.4 The REA Algorithm with Piecewise Linear Reconstruction 

Recall the reconstruct-evolve-average (REA) Algorithm 4. 1 introduced in Section 4. 10. For 
the scalar advection equation we derived the upwind method by reconstructing a piecewise 
constant function q" (x , t„) from the cell averages Q'] , solving the advection equation with 
this data, and averaging the result at time f„ +1 over each grid cell to obtain Q " +l . To achieve 
better than first-order accuracy, we must use a better reconstruction than a piecewise constant 
function. From the cell averages Q" we can construct a piecewise linear function of the 
form 



q'\x, t n ) = Q1 + <y"(x - Xj) for x,_, /2 < x < x i+l/2 , (6.1 1) 

where 

1, . 1 

Xi = -(*/— 1/2 + Xi+l/2) = Xi- 1/2 + -Ax (6.12) 

is the center of the i th grid cell and er " is the slope on the i th cell. The linear function (6. 1 1 ) on 
the i th cell is defined in such a way that its value at the cell center x, is Q" . More importantly, 
the average value of q"(x , t„) over cell C ,• is Q" (regardless of the slope o' 1 ), so that the 
reconstructed function has the cell average Q" . This is crucial in developing conservative 
methods for conservation laws. Note that steps 2 and 3 are conservative in general, and so 
Algorithm 4.1 is conservative provided we use a conservative reconstruction in step 1, as 
we have in (6. 11). Later we will see how to write such methods in the standard conservation 
form (4.4). 

For the scalar advection equation q, + uq x — 0, we can easily solve the equation with this 
data, and compute the new cell averages as required in step 3 of Algorithm 4.1. We have 



q n {x, t n+ 1 ) = q"(x — u At, t„). 
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Until further notice we will assume that u > 0 and present the formulas for this particular 
case. The corresponding formulas for u < 0 should be easy to derive, and in Section 6.10 
we will see a better way to formulate the methods in the general case. 

Suppose also that \u Af/Ax| < 1, as is required by the CFL condition. Then it is 
straightforward to compute that 



„ | 1 ll L-ll I „ L „ \ I W Z_X t 

0 " + = a 7 ^ 1 + 2 (A ^- SAf) < 1 j + ( 1 - a7 

u At , „ „ , 1 u At 

= Q ‘~ A7 (e; “ e '-‘) “ 2 a7 <A ' “ s A ')( c 



Again note that this is the upwind method with a correction term that depends on the slopes. 



6.5 Choice of Slopes 

Choosing slopes er" = 0 gives Godunov’s method (the upwind method for the advection 
equation), since the final term in (6. 13) drops out. To obtain a second-order accurate method 
we want to choose nonzero slopes in such a way that a " approximates the derivative q x 
over the ith grid cell. Three obvious possibilities are 



Centered slope: 


y , 07 + i - 07-i 

' 2 A.r 


(Fromm), 


(6.14) 


Upwind slope: 


07 - 07-i 

' Ax 


(B earn- Warming ) , 


(6.15) 


Downwind slope: 


07+i - 07 

' Ax 


(Lax-Wendroff) . 


(6.16) 



The centered slope might seem like the most natural choice to obtain second-order accuracy, 
but in fact all three choices give the same formal order of accuracy, and it is the other two 
choices that give methods we have already derived using the Taylor series expansion. Only 
the downwind slope results in a centered three-point method, and this choice gives the 
Lax-Wendroff method. The upwind slope gives a fully-upwinded 3-point method, which 
is simply the Beam-Warming method. 

The centered slope (6. 14) may seem the most symmetric choice at first glance, but because 
the reconstructed function is then advected in the positive direction, the final updating 
formula turns out to be a nonsymmetric four-point formula, 

07 +1 = 07 - ^ x (Qi + 1 + 3 Ql- 5 QU + 07 - 2 ) 

- jx^(Qi + 1 - QI ~ QI- 1 + QU- (6.17) 

This method is known as Fromm ’s method. 
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6.6 Oscillations 

As we have seen in Figure 6.1, second-order methods such as the Lax-Wendroff or Beam- 
Warming (and also Fromm’s method) give oscillatory approximations to discontinuous 
solutions. This can be easily understood using the interpretation of Algorithm 4.1. 

Consider the Lax-Wendroff method, for example, applied to piecewise constant data 
with values 



Ql = 



1 if i < J, 

0 if ; > J. 



Choosing slopes in each grid cell based on the Lax-Wendroff prescription (6.16) gives the 
piecewise linear function shown in Figure 6.4(a). The slope a' 1 is nonzero only for i = J . 

The function q"(x, t„) has an overshoot with a maximum value of 1.5 regardless of Ax. 
When we advect this profile a distance u At, and then compute the average over the /th cell, 
we will get a value that is greater than 1 for any At with 0 < u At < Ax. The worst case is 
whenS At = Ax/2, in which caseq n (x, t n+ i) is shown in Figure 6.4(b) and Q"j +1 — 1.125. 
In the next time step this overshoot will be accentuated, while in cell / — 1 we will now 
have a positive slope, leading to a value Q n jt\ that is less than 1 . This oscillation then grows 
with time. 

The slopes proposed in the previous section were based on the assumption that the 
solution is smooth. Near a discontinuity there is no reason to believe that introducing this 
slope will improve the accuracy. On the contrary, if one of our goals is to avoid nonphysical 
oscillations, then in the above example we must set the slope to zero in the /th cell. Any 
cjj < 0 will lead to Q') +l > 1, while a positive slope wouldn’t make much sense. On the 
other hand we don’t want to set all slopes to zero all the time, or we simply have the 
first-order upwind method. Where the solution is smooth we want second-order accuracy. 
Moreover, we will see below that even near a discontinuity, once the solution is somewhat 
smeared out over more than one cell, introducing nonzero slopes can help keep the solution 
from smearing out too far, and hence will significantly increase the resolution and keep 
discontinuities fairly sharp, as long as care is taken to avoid oscillations. 

This suggests that we must pay attention to how the solution is behaving near the i th 
cell in choosing our formula for a". (And hence the resulting updating formula will be 
nonlinear even for the linear advection equation). Where the solution is smooth, we want to 
choose something like the Lax-Wendroff slope. Near a discontinuity we may want to limit 
this slope, using a value that is smaller in magnitude in order to avoid oscillations. Methods 
based on this idea are known as slope-limiter methods. This approach was introduced by van 





Fig. 6.4. (a) Grid values Q" and reconstructed q"(-,t n ) using Lax-Wendroff slopes, (b) After advection 
with u At — Ax/2. The dots show the new cell averages Q" +l . Note the overshoot. 
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Leer in a series of papers, [464] through [468], where he developed the approach known 
as MUSCL (monotonic upstream-centered scheme for conservation laws) for nonlinear 
conservation laws. 



6.7 Total Variation 

How much should we limit the slope? Ideally we would like to have a mathematical prescrip- 
tion that will allow us to use the Lax-Wendroff slope whenever possible, for second-order 
accuracy, while guaranteeing that no nonphysical oscillations will arise. To achieve this we 
need a way to measure oscillations in the solution. This is provided by the notion of the 
total variation of a function. For a grid function Q we define 

OO 

TV(Q)= Y, \Q> Ci-tl- (6.18) 

i =—oo 

For an arbitrary function ^(i) we can define 

N 

T V(q) = sup2>(&) - qGj-i)\, (6.19) 

j = i 

where the supremum is taken over all subdivisions of the real line — oo = §o < §i < • • • < 
— oo. Note that for the total variation to be finite, Q or q must approach constant values 
q ± as x — >• ±oo. 

Another possible definition for functions is 

1 r°° 

TV(q ) = limsup - / | q(x) — q(x — e)| dx. (6.20) 

€— >0 t J — OO 

If q is differentiable, then this reduces to 

/ OO 

\q'{x)\dx. (6.21) 

-OO 

We can use (6.21) also for nondifferentiable functions (distributions) if we interpret q'(x) 
as the distribution derivative (which includes delta functions at points where q is discon- 
tinuous). Note that if we define a function q(x) from a grid function Q using a piecewise 
constant approximation, then TVb/) = TV( Q). 

The true solution to the advection equation simply propagates at speed u with unchanged 
shape, so that the total variation TV(g(-, t)) must be constant in time. A numerical solution 
to the advection equation might not have constant total variation, however. If the method 
introduces oscillations, then we would expect the total variation of Q" to increase with 
time. We can thus attempt to avoid oscillations by requiring that the method not increase 
the total variation: 

Definition 6.1. A two-level method is called total variation diminishing (TVD) if for any 
set of data Q n , the values Q n+l computed by the method satisfy 



TV(<2" +1 ) < TV(G"). 



(6.22) 
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Note that the total variation need not actually diminish in the sense of decreasing; it 
may remain constant in time. A better term might be total variation nonincreasing. In 
fact this term (and the abbreviation TVNI) was used in the original work of Harten [179], 
who introduced the use of this tool in developing and analyzing numerical methods for 
conservation laws. It was later changed to TVD as a less cumbersome term. 

If a method is TVD, then in particular data that is initially monotone, say 

Q1>QU i for all i, 

will remain monotone in all future time steps. Hence if we discretize a single propagating 
discontinuity (as in Figure 6.4), the discontinuity may become smeared in future time steps 
but cannot become oscillatory. This property is especially useful, and we make the following 
definition. 



Definition 6.2. A method is called monotonicity-preserving if 

Q- > e:v, for am 

implies that 

Q'! + ' > Q-tl for alii. 



Any TVD method is monotonicity-preserving; see Exercise 6.3. 



6.8 TVD Methods Based on the REA Algorithm 

How can we derive a method that is TVD? One easy way follows from the reconstruct- 
evolve-average approach to deriving methods described by Algorithm 4. 1 . Suppose that 
we perform the reconstruction in such a way that 

TVOf(-,f„))<TV(0"). (6.23) 

Then the method will be TVD. The reason is that the evolving and averaging steps cannot 
possibly increase the total variation, and so it is only the reconstruction that we need to 
worry about. 

In the evolve step we clearly have 

TV(«f(., t n+1 )) = TV(<f(-, t n )) (6.24) 

for the advection equation, since q” simply advects without changing shape. The total 
variation turns out to be a very useful concept in studying nonlinear problems as well; 
for we will see later that a wide class of nonlinear scalar conservation laws also have this 
property, that the true solution has a nonincreasing total variation. 

It is a simple exercise (Exercise 6.4) to show that the averaging step gives 

TV((2" +1 ) < TVOfO, t n+ 1)). (6.25) 

Combining (6.23), (6.24), and (6.25) then shows that the method is TVD. 
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6.9 Slope-Limiter Methods 

We now return to the derivation of numerical methods based on piecewise linear reconstruc- 
tion, and consider how to limit the slopes so that (6.23) is satisfied. Note that setting er" = 0 
works, since the piecewise constant function has the same TV as the discrete data. Hence 
the first-order upwind method is TVD for the advection equation. The upwind method may 
smear solutions but cannot introduce oscillations. 

One choice of slope that gives second-order accuracy for smooth solutions while still 
satisfying the TVD property is the minmod slope 



er" = minmod 



Q" ~ 6 " 

Ax 



Q'U i ~ Q’i 

Ax 



where the minmod function of two arguments is defined by 



minmod(fl , h) 



a if |a| < \b\ and ab > 0, 
b if |£>| < |a| and ab > 0, 
0 if ab < 0. 



(6.26) 



(6.27) 



If a and b have the same sign, then this selects the one that is smaller in modulus, else it 
returns zero. 

Rather than defining the slope on the / th cell by always using the downwind difference 
(which would give the Lax-Wendroff method), or by always using the upwind difference 
(which would give the Beam-Warming method), the minmod method compares the two 
slopes and chooses the one that is smaller in magnitude. If the two slopes have different 
sign, then the value Q" must be a local maximum or minimum, and it is easy to check in 
this case that we must set er" = 0 in order to satisfy (6.23). 

Figure 6.2(a) shows results using the minmod method for the advection problem con- 
sidered previously. We see that the minmod method does a fairly good job of maintaining 
good accuracy in the smooth hump and also sharp discontinuities in the square wave, with 
no oscillations. 

Sharper resolution of discontinuities can be achieved with other limiters that do not 
reduce the slope as severely as minmod near a discontinuity. Figure 6.5(a) shows some 
sample data representing a discontinuity smeared over two cells, along with the minmod 
slopes. Figure 6.5(b) shows that we can increase the slopes in these two cells to twice the 
value of the minmod slopes and still have (6.23) satisfied. This sharper reconstruction will 
lead to sharper resolution of the discontinuity in the next time step than we would obtain 
with the minmod slopes. 




Fig. 6.5. Grid values Q" and reconstructed q '\- , t „ ) using (a) minmod slopes, (b) superbee or MC 
slopes. Note that these steeper slopes can be used and still have the TVD property. 
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One choice of limiter that gives the reconstruction of Figure 6.5(b), while still giving 
second order accuracy for smooth solutions, is the so-called superbee limiter introduced by 
Roe [378]: 



where 



a — maxmoi 



d(a/%/ 2) ) 



a \ 1 * = minmod 



= minmod I 2 



e? +1 -e?\ JQ1-Q n i - 1 



Ax J \ Ax 

Q'!+i - Q" \ (Ql-QU 



Ax 



Ax 



(6.28) 



Each one-sided slope is compared with twice the opposite one-sided slope. Then the maxmod 
function in (6.28) selects the argument with larger modulus. In regions where the solution 
is smooth this will tend to return the larger of the two one-sided slopes, but will still be 
giving an approximation to q x , and hence we expect second-order accuracy. We will see 
later that the superbee limiter is also TVD in general. 

Figure 6.2(b) shows the same test problem as before but with the superbee method. The 
discontinuity stays considerably sharper. On the other hand, we see a tendency of the smooth 
hump to become steeper and squared off. This is sometimes a problem with superbee - by 
choosing the larger of the neighboring slopes it tends to steepen smooth transitions near 
inflection points. 

Another popular choice is the monotonized central-difference limiter (MC limiter), which 
was proposed by van Leer [467] : 



cr" = minmod 



Q'U t ~ QU 

2Ax 



Q'l - QU 

Ax 



Q’U i ~ Q" 

Ax 



(6.29) 



This compares the central difference of Fromm’s method with twice the one-sided slope to 
either side. In smooth regions this reduces to the centered slope of Fromm’s method and 
hence does not tend to artificially steepen smooth slopes to the extent that superbee does. 
Numerical results with this limiter are shown in Figure 6.2(c). The MC limiter appears to 
be a good default choice for a wide class of problems. 



6.10 Flux Formulation with Piecewise Linear Reconstruction 

The slope-limiter methods described above can be written as flux-differencing methods 
of the form (4.4). The updating formulas derived above can be manipulated algebraically 
to determine what the numerical flux function must be. Alternatively, we can derive the 
numerical flux by computing the exact flux through the interface x,-_ 1/2 using the piecewise 
linear solution q"(x, t), by integrating M<y"(x,_i / 2 , t ) in time from t n tot„ + i.Fortheadvection 
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equation this is easy to do and we find that 

1 r"’ +l 



F n — 

r i- 1/2 — 



\ /*'«+ 1 

= — J' uq n (x i _ 1 / 2 ,t)dt 

/ <n+l 

uq n {x,- 1/2 — u(t — t„), t„) dt 

C tn + 1 

J u[Q1- 1 + (-* 1 - 1/2 - u(t - t„) - Xi-l)o?_^\dt 



1 l' r " +1 

A t 



1 f'" +l 

At 



— uQ"_ l + —u(Ax — u Af Jrr ( "_| . 

Using this in the flux-differencing formula (4.4) gives 
■ i u At , 1 u At 

Q! = Q ‘ ~ a7 (0 ' “ G '-') “ 2a7 (Ai “ 5 



i i - 



i). 



which agrees with (6.13). 

If we also consider the case u < 0, then we will find that in general the numerical flux 
for a slope-limiter method is 



f m < 2 | + ^t/(A.r — m Ar)cr'L[ if u > 0, 

Fi-\/2 ~ i . (6.30) 

l «2" — \u{ Ax + m At)a" if u < 0, 

where a" is the slope in the ith cell Cj, chosen by one of the formulas discussed previously. 

Rather than associating a slope er" with the ith cell, the idea of writing the method in 
terms of fluxes between cells suggests that we should instead associate our approximation 
to q x with the cell interface at .r,_i /2 where is defined. Across the interface x;_i /2 

we have a jump 



a 2"-i/2 = Q"i ~ Q1-u (6-31) 

and this jump divided by A.r gives an approximation to q , . This suggests that we write the 
flux (6.30) as 



F ?_ l/2 = u-Q n i+ u + QU + \\u\[\ 



u At 



Ax 



1 / 2 > 



(6.32) 



where 



= a limited version of A (2"_ 1//2 . (6.33) 

If 8'U, 2 i s Ih e j um P AQ ”_|/2 itself, then (6.32) gives the Lax-Wendroff method (see 
Exercise 6.6). From the form (6.32), we see that the Lax-Wendroff flux can be interpreted 
as a modification to the upwind flux (4.33). By limiting this modification we obtain a 
different form of the high-resolution methods, as explored in the next section. 
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6.11 Flux Limiters 

From the above discussion it is natural to view the Lax-Wendroff method as the basic 
second-order method based on piecewise linear reconstruction, since defining the jump 
<5"_ , y 2 in (6.33) in the most obvious way as A at the interface .t, — 1/2 results in that 
method. Other second-order methods have fluxes of the form (6.32) with different choices 
of 8 ; ! _ 1/2 - The slope-limiter methods can then be reinterpreted as flux-limiter methods by 
choosing 8'l_ X / 2 to be a limited version of (6.31). In general we will set 

Ci/2 = 0(^ 1/ 2)Aer_ 1/2 , ( 6 . 34 ) 



where 



*1 — 1/2 



AG/- 1/2 

A QUn 



(6.35) 



The index I here is used to represent the interface on the upwind side of jc,-— i/ 2 : 



i — 1 if u > 0, 
i + 1 if m < 0. 



(6.36) 



The ratio 9"_ 1 p can be thought of as a measure of the smoothness of the data near .r,_i/ 2 . 
Where the data is smooth we expect 0 "_ , /2 ~ 1 (except at extrema). Near a discontinuity 
we expect that 0"_ l p may be far from 1. 

The function (p(0) is the flux-limiter function, whose value depends on the smoothness. 
Setting f>{9) = 1 for all 0 gives the Lax-Wendroff method, while setting <p(Q) = 0 gives 
upwind. More generally we might want to devise a limiter function cp that has values near 
1 for 0^1, but that reduces (or perhaps increases) the slope where the data is not smooth. 

There are many other ways one might choose to measure the smoothness of the data 
besides the variable 6 defined in (6.35). However, the framework proposed above results in 
very simple formulas for the function (p corresponding to many standard methods, including 
all the methods discussed so far. 

In particular, note the nice feature that choosing 



0 ( 0 ) = 0 



(6.37) 



results in (6.34) becoming 

s, " i/2 = 

Hence this choice results in the jump at the interface upwind from x,_i/ 2 being used to 
define 8f_ l /. 1 instead of the jump at this interface. As a result, the method (6.32) with the 
choice of limiter (6.37) reduces to the Beam- Warming method. 

Since the centered difference (6.14) is the average of the one-sided slopes (6.15) and 
(6.16), we also find that Fromm’s method can be obtained by choosing 

0 ( 0 )= + 0 ). 



(6.38) 
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Also note that cp(0) = 2 corresponds to using = 2 A Q"_ l p, i.e., twice the jump at 
this interface, while <f>(0) = 26 results in using twice the jump at the upwind interface. Recall 
that these are necessary ingredients in some of the slope limiters discussed in Section 6.9. 
Translating the various slope limiters into flux-limiter functions, we obtain the functions 
found below for the methods previously introduced. 

Linear methods: 



upwind : 
Lax-Wendroff : 
Beam-Warming : 

Fromm : 



m = o, 
m = h 
m = o, 

<K0) = \(i + 0) 



High-resolution limiters: 



minmod : 
superbee : 
MC : 

van Leer : 



<p(0) = minmodd, 0), 

(f)(0) = max(0, mind, 20). min(2, 0)), 
(f)(0) = max(0, min((1 + Q)/2, 2, 20)) 

\e\ 



<i>m = 



i + i0i' 



(6.39a) 



(6.39b) 



The van Leer limiter listed here was proposed in [465]. A wide variety of other limiters 
have also been proposed in the literature. Many of these limiters are built into CLAWPACK. 
The parameter mthlim in clawlez (see Section 5.4.6) determines which limiter is used. 
Other limiters are easily added to the code by modifying the file claw/clawpack/ld/lib/ 
philim.f. 

The flux-limiter method has the flux (6.32) with gi ven by (6.34). Let v — u At/ Ax 
be the Courant number. Then the flux-limiter method takes the form 

Q " +1 = Q'!-AQi-QU) 

- lv(i - v)[0(0f + i/ 2 )(e' ! + i - Q-) - 4>{eu nW! ~ QU)\ (6.40) 

if u > 0, or 

q - +1 = q"-v(q'; + 1 -q';) 

+ l -v(\ + p)[0(0” +1/2 )(C" + i - Q") - 0(^1 /2 )(G" - Q1- 1 )] (6.41) 

if u < 0. 



6.12 TVD Limiters 

For simple limiters such as minmod, it is clear from the derivation as a slope limiter 
(Section 6.9) that the resulting method is TVD, since it is easy to check that (6.23) is 
satisfied. For more complicated limiters we would like to have an algebraic proof that the 
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resulting method is TVD. A fundamental tool in this direction is the following theorem 
of Harten [179], which can be used to derive explicit algebraic conditions on the function 
4> required for a TVD method. For some other discussions of TVD conditions, see [180], 
[349], [429], [435], [465], 

Theorem 6.1 (Harten). Consider a general method of the form 

Ql + 1 = Qi - ChiQ 1 ! - QU) + D i{Q"+i - Qi ) (6-42) 

over one time step, where the coefficients C"_ 1 and D " are arbitrary values (which in 
particular may depend on values of Q n in some way, i.e., the method may be nonlinear). 
Then 



TV(g" +1 ) < TV(Q") 

provided the following conditions are satisfied: 

c'U >o v/, 

Df > 0 V/, (6.43) 

C” + D? <1 V/. 

Note: the updating formula for Q" +1 uses C"_i and D'f but the last conditio n involves C" 
and D". 

For the proof see Exercise 8.5. We can apply this theorem to the flux-limiter method for 
q t + uq x — 0. We consider the case u > 0 here (see Exercise 6.7 for the case u < 0), so 
that the method has the form (6.40). There are many ways to rewrite this in the form (6.42), 
since C"_ 1 and D” are allowed to depend on Q" . The obvious choice is 

CU = v - \v{i - 

but this can’t be effectively used to prove the method is TVD, as there is no hope of satisfying 
the condition (6.43) using this. If 0 < v < 1 then D" < 0 when 0 is near 1. 

Instead note that 



Q ! + 1 - Ql = (QI - QU)K +l/1 , 

and so the formula (6.40) can be put into the form (6.42) with 



C'U = v + -v(l - v) 



'HW+ih) 



^(^-1/2) 



D" = 0. 



1+1/2 




6.12 TVD Limiters 



117 



The conditions (6.43) are then satisfied provided that 

0 < C’U < 1. 



This in turn holds provided that the CFL condition 0 < v < 1 holds, along with the bound 

I m) 



6i 



- <p(9 2 ) 



< 2 



for all values of 61 , 62 - 



(6.44) 



If 6 < 0, then we are at an extremum, and we know from our previous discussion that we 
should take <p( 6 ) — 0 in this case to achieve a TVD method. Also, when 6 > 0 we want 
(f)( 0 ) > 0, since it generally doesn’t make sense to negate the sign of the slope in applying 
the limiter. Since ()\ and (h in (6.44) are independent, we then see that we must require 



<b(6 ) 

0 < — < 2 

~ 6 ~ 



and 0 < <p( 6 ) < 2 



(6.45) 



for all values of 6 > 0 in order to guarantee that condition (6.44) is satisfied (along with 
< p{ 6 ) = 0 for 6 < 0). These constraints can be rewritten concisely as 



0 < <p(9) < minmod(2, 26). 



(6.46) 



This defines the TVD region in the 6 -(j) plane: the curve (f)( 6 ) must lie in this region, which 
is shown as the shaded region in Figure 6.6(a). This figure also shows the functions <f>( 6 ) 




Fig. 6.6. Limiter functions cj>(6). (a) The shaded regions shows where function values must lie for 
the method to be TVD. The second-order linear methods have functions tp(9) that leave this region, 
(b) The shaded region is the Sweby region of second-order TVD methods. The minmod limiter lies 
along the lower boundary, (c) The superbee limiter lies along the upper boundary. ( d ) The MC limiter 
is smooth at (f> = 1 . 
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from (6.39a) for the Lax-Wendroff, Beam-Warming, and Fromm methods. All of these 
functions lie outside the shaded region for some values of 6, and indeed these methods are 
not TVD. This graphical analysis of 0 was first presented by Sweby [429], who analyzed a 
wide class of flux-limiter methods (for nonlinear conservation laws as well as the advection 
equation). 

Note that for any second-order accurate method we must have 0(1)= 1. Sweby found, 
moreover, that it is best to take 0 to be a convex combination of 0 = 1 (Lax-Wendroff) 
and 0 = 0 (Beam- Warming). Other choices apparently give too much compression, and 
smooth data such as a sine wave tends to turn into a square wave as time evolves, as is 
already seen to happen with the superbee limiter. Imposing this additional restriction gives 
the second-order TVD region of Sweby, which is shown in Figure 6.6(b). 

The high-resolution limiter functions from (6.39b) are all seen to satisfy the constraints 
(6.46), and these limiters all give TVD methods. The functions 0 are graphed in Figure 6.6. 
Note that minmod lies along the lower boundary of the Sweby region, while superbee lies 
along the upper boundary. The fact that these functions are not smooth at 0 = 1 corresponds 
to the fact that there is a switch in the choice of one-sided approximation used as 0 crosses 
this point. For full second-order accuracy we would like the function 0 to be smooth near 
0 = 1, as for the MC limiter. The van Leer limiter is an even smoother version of this. 

We also generally want to impose a symmetry condition on the function 0(0). If the data 
Q" is symmetric in x, then we might expect the reconstructed piecewise linear function to 
have this same property. It can be shown (Exercise 6.8) that this requires that the function 
0 also satisfy 

0 ( 0 ) 

0(1/0) =^-A (6.47) 

U 

All of the high-resolution functions listed in (6.39b) satisfy this condition. 



6.13 High-Resolution Methods for Systems 

The slope-limiter or flux-limiter methods can also be extended to systems of equations. 
This is most easily done in the flux-limiter framework. First recall that the Lax-Wendroff 
method (6.4) can be written in flux-differencing form (4.4) if we define the flux by 

HQi- 1, Qi) = \A{Qi_ x + Qi) - - e,_i). (6.48) 

2 2 Ax 

Since A = A + + A~, we can rewrite this as 

HQi- i, Qi) = (A + e,„! +A-Q i )+^\A\(l-^\A\\(Q i - (6.49) 

where |A| = A + — A~ . 

In the form (6.49), we see that the Lax-Wendroff flux can be viewed as being composed 
of the upwind flux (4.56) plus a correction term, just as for the scalar advection equation. To 
define a flux-limiter method we must limit the magnitude of this correction term according 
to how the data is varying. But for a system of equations, A Q/- 1/2 = Qi — <2;_i is a vector, 
and it is not so clear how to compare this vector with the neighboring jump vector A Q/- 3/2 
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or AQ/+ 1/2 to generalize (6.34). It is also not clear which neighboring jump to consider, 
since the “upwind” direction is different for each eigencomponent. The solution, of course, 
is that we must decompose the correction term in (6.49) into eigencomponents and limit 
each scalar eigencoefficient separately, based on the algorithm for scalar advection. 

We can rewrite the correction term as 



1 

2 



IAI / 



At 

A^IKft 



c,-i) = -|A| 



'-f >)E«;v 



p = i 



where r p are the eigenvectors of A and the coefficients oti_ij 2 are defined by (4.38). The 
flux-limiter method is defined by replacing the scalar coefficient af_ l / 2 by a limited version, 
based on the scalar formulas of Section 6.11. We set 



where 



f-l /2 — a f- 1/2$ (^-1/2)' 



9 p 

■'i — 1/2 



*/- 1/2 



* 1 - 1/2 



with I 



! — 1 
i + 1 



if X p > 0, 

if x p < 0 , 



(6.50) 



(6.51) 



and 4> is one of the limiter functions of Section 6.1 1. The flux function for the flux-limiter 
method is then 



Fi- 1/2 — A + Qi-\ + A Qi + F,_i/2, (6.52) 

where the first term is the upwind flux and the correction flux Fi- 1/2 is defined by 

1 / At \ m 

Fi- 1/2 = -|A| ( i - — |A| 1 (6-53) 

' ' p= 1 

Note that in the case of a scalar equation, we can take /- 1 = 1 as the eigenvector of A — u, so 
that A<2 ,_i/2 = a j_i/ 2 , which is what we called 5,- — 1/2 in Section 6.1 1. The formula (6.52) 
then reduces to (6.32). Also note that the flux T\_i/ 2 (and hence Z 7 / — 1 / 2 ) depends not only 
on Qi-i and Q,, but also on Qi-i and Qi+\ in general, because neighboring jumps are 
used in defining the limited values af_ 1 / 2 in (6.50). The flux-limiter method thus has a five- 
point stencil rather than the three-point stencil of the Lax-Wendroff. This is particularly 
important in specifying boundary conditions (see Chapter 7). Note that this widening of the 
stencil gives a relaxation of the CFL restriction on the time step. For a five-point method 
the CFL condition requires only that the Courant number be less than 2. However, the CFL 
condition gives only a necessary condition on stability, and in fact these high-resolution 
methods are generally not stable for Courant numbers between 1 and 2. The larger stencil 
does not lead to a greater stability limit because the additional information is used only to 
limit the second-order correction terms. 

Note that \A\r p = \X p \r p , so that (6.53) may be rewritten as 



1 m / 

^■-i/2= 9 El^l ! - 



At 

Ax 



1^1 



<y p r p 
a i- 1 / 2 ' • 



(6.54) 
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This method for a system of equations can also be viewed as a slope-limiter method, if 
we think of (Qi — gf-t)/A.r as approximating the slope vector, with each element giving 
an approximation to the slope for the corresponding element of q in a piecewise linear 
reconstruction. One might be tempted to limit the slope for each element of the vector 
separately, but in fact it is much better to proceed as above and limit the eigencoefficients 
obtained when the slope vector is expanded in eigenvectors of A. As a result, each wave is 
limited independently of other families, and the accuracy of smooth waves is not adversely 
affected by limiters being applied to other waves that may not be smooth. 

For these wave-propagation algorithms it is perhaps most natural to view the limiter 
function as being a wave limiter rather than a slope limiter or flux limiter, since it is really the 
individual waves that are being limited after solving the Riemann problem. This viewpoint 
carries over naturally to nonlinear systems as introduced in Section 6.15 and explored more 
fully in Chapter 15. In the more general context, it is natural to use the notation 



w,-t/2 = eti 



i- 1/2' 



(6.55) 



to denote the limited wave appearing in (6.54). 



6.14 Implementation 

For the constant-coefficient linear system, we could compute the matrices A + , A~, and A 
once and for all and compute the fluxes directly from the formulas given above. However, 
with limiters we must solve the Riemann problem at each interface to obtain a decomposition 
of A Qi —\/2 into waves aj , _ 1 pr p and wave speeds X p , and these can be used directly in the 
computation of Q" +l without ever forming the matrices. This approach also generalizes 
directly to nonlinear systems of conservation laws, where we do not have a single matrix 
A , but can still solve a Riemann problem at each interface for waves and wave speeds. This 
generalization is discussed briefly in the next section. 

To accomplish this most easily, note that if we use the flux (6.52) in the flux-differencing 
formula (4.4) and then rearrange the upwind terms, we can write the formula for g" +1 as 



<2' !+1 = Qi - ^(A + Ag,-i/2 + A-AQ i+l/2 ) - ^(F i+ i/2 - F ;- 1/2 ), 



where we drop the superscript n from the current time step because we will need to use 
superscript p below to denote the wave family. Each of the terms in this expression can be 
written in terms of the waves V\lj’_ l / 2 = <xf_ \pr p and wave speeds X p : 



m 

A+AQi- l/2 = J>0 + Wf_ 1/2 , 
p= i 
m 

A-AQ i ^ /2 = Y J {X p )-W?_ l/2 , 



p = 1 



p= 1 x 7 



Wf- 1 / 2 - 



(6.56) 
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6.15 Extension to Nonlinear Systems 

The full extension of these methods to nonlinear problems will be discussed in Chapter 15 
after developing the theory of nonlinear equations. However, the main idea is easily ex- 
plained as a simple extension of what has been done for linear systems. Given states <2,_ i and 
Qi, the solution to the Riemann problem will be seen to yield a set of waves W/l 1//2 € R m 
and speeds s^_^ 2 e R, analogous to the linear problem, though now the speeds will vary 
with i, and so will the directions of the vectors W/Lj /2 i n phase space; they will no longer 
all be scalar multiples of a single set of eigenvectors r p . 

The quantities 4 4 A and A~ A<2;_i / 2 have been generalized to fluctuations in 

Chapter 4, denoted by A + A(2 ,_i /2 and A~ A 2,- 1/2, with the property that 



A~AQi_ 1/2 + ^Ae M /2 = f(Qi) - f(Qi- 1 ). (6.57) 

Note that for the linear case f{Q ,) — /(<2/_i) = A A 0,_ 1/2, and this property is satisfied. 
In general we can think of setting 



-4 -A G ,-- 1/2 = E( s f- 1/2 )' W ' I/2 - 

P= 1 
m 

A + AQ l _ l/2 = '£ i (s?_ 1/2 ) + Wf_ l/2 , 

p = 1 



(6.58) 



a direct extension of (6.56). There are, however, some issues concerned with rarefaction 
waves and entropy conditions that make the nonlinear case more subtle, and the proper 
specification of these flux differences will be discussed later. Once the waves, speeds, and 
flux differences have been suitably defined, the algorithm is virtually identical with what 
has already been defined in the linear case. We set 

Q'! +1 = Qi ~ ^{A~AQ i+1/2 + A + AQi_ l/2 ) - ^{F i+ 1/2 - E,- 1/2 ), (6.59) 

where 

Fi- i/2 =^E Ki / 2 | (l - X7K1/2I) W/V (6.60) 

p= 1 ^ ' 



Here Wj , _ l / 2 represents a limited version of the wave Wj , _ l ^ 2 , obtained by comparing this 
jump with the jump W ?_ , /2 in the same family at the neighboring Riemann problem in the 
upwind direction, so 



ji -1 if.st, /2 > 0, 
[i + 1 if^_ 1/2 <0. 



(6.61) 



This limiting procedure is slightly more complicated than in the constant-coefficient case, 
in that W’ 1 ^i/ 2 and Wf_ 1//2 are in general no longer scalar multiples of the same vector 
r p . So we cannot simply apply the scalar limiter function <p(9) to the ratio of these scalar 
coefficients as in the constant-coefficient linear case. Instead we can, for example, project 




122 



6 High-Resolution Methods 



the vector Wf_]y 2 onto and compare the length of this projection with the length 

ofW,-i /2 . This same issue arises for variable-coefficient linear systems and is discussed in 
Section 9.13. 

When no limiting is used, this method is formally second-order accurate provided certain 
conditions are satisfied by the Riemann solution used; see Section 15.6. The methods 
generally perform much better, however, when limiters are applied. 



6.16 Capacity-Form Differencing 

In many applications the system of conservation laws to be solved most naturally takes the 
form 



K(x)q, + f (q) x = 0, 



(6.62) 



where k(x) is a spatially varying capacity function as introduced in Section 2.4. In the 
remainder of this chapter we will see how to extend the high-resolution methods defined 
above to this situation. In particular, this allows us to extend the methods to nonuniform 
grids as we do in Section 6.17. This material can be skipped without loss of continuity. 

The equation (6.62) generally arises from an integral conservation law of the form 



d 

dt 




K(x)q(x, t)dx = f(q(x i, t)) - f(q(x 2 , t)), 



(6.63) 



in which x(x)q(x, t) is the conserved quantity while the flux at a point depends most 
naturally on the value of q itself. It may then be most natural to solve Riemann problems 
corresponding to the flux function f(q), i.e., by solving the equation 



q, + f(q)x = 0 



(6.64) 



locally at each cell interface, and then use the resulting wave structure to update the solution 
to (6.62). This suggests a method of the form 

X, G ” +1 = X, Q" - Q’i+i) - HQi-v Qi)l (6.65) 

Dividing by /q gives 

Q '! +1 = Qi ~ Q'!+ i ) - HQi-u Q1)\ ( 6 - 66 ) 

The form of equation (6.66) will be called capacity-form differencing. It is a generalization of 
conservation-form differencing to include the capacity function. The factor /e, Ax appearing 
in (6.66) has a natural interpretation as the effective volume of the i th grid cell if we think of 
Kj as a fraction of the volume available to the fluid (as in porous-media flow, for example). 
Note that the updating formula (6.66) has the advantage of being conservative in the proper 
manner for the conservation law (6.63), since computing 

A a- ^2 K i Q'l +1 = Ax ^2 K i Q" + boundary fluxes 

i i 
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from (6.65) shows that all the fluxes cancel except at the boundaries, where the boundary 
conditions must come into play. 

We can rewrite the method (6.66) in the framework of the high-resolution wave- 
propagation algorithm as 



Q1 +1 = Q'\ 



At 

Ki Ax 



(A + aqu /2 + a-aqi +1/2 ) 



At 
Ki Ax 



(Fi+W-Fi-w)' ( 6 - 67 ) 



where as usual A^ AQi- 1/2 are the fluctuations and F,-_ 1/2 is the correction flux based on 
the Riemann solution for (6.64) at the interface x,_ 1 / 2 . We now use the correction flux 



Fi-l/2 = 



p= 1 



At 

Ki- 1/2 Ax 



1 » / 


i LP I 




| r /—1/2 I 



Wf-1,2- 



(6.68) 



This is the general form for a system of m equations with M w waves, as introduced in 
Section 6.15. The wave W] , _ l / 2 is a limited version of the wave 2 , just as before, and 
these waves again are obtained by solving the Riemann problem for (6.64), ignoring the 
capacity function. The only modification from the formula for F j_\/ 2 given in (6.60) is that 
Ax is replaced by /c,_ 1/2 Ax, where 



1 

Ki- 1/2 = -0,-1 + Ki). (6.69) 

Clearly, replacing Ax by some form of k Ax is necessary on the basis of dimensional 
arguments. If k ix ) is smoothly varying, then using either /c,_i Ax or at,- Ax would also 
work, and the resulting method is second-order accurate for smooth solutions with any of 
these choices. 

The choice at, — 1/2 seems most reasonable, however, since the flux F ,-\/ 2 is associated 
with the interface between cells C,_i and C, , rather than with either of the two cells. This 
term in the correction flux gives an approximation to the ^ A t 2 q,, term in the Taylor series 
expansion once it is inserted in the flux-differencing term of (6.67). We compute, for smooth 
solutions, that 



= ~ f -[/'(?)] V) , 

K \ K )x 

and centering of these terms again suggests that the inner 1 //c should be evaluated at x,± i/ 2 - 
In Section 6.17.1, where nonuniform grids are discussed, we will see that this choice makes 
clear physical sense in that case, and is correct even if k is not smoothly varying. 



6.17 Nonuniform Grids 

One natural example of a capacity function arises if we solve a standard conservation law 
q, + f(q) x = 0 on a nonuniform grid such as shown on the right of Figure 6.7. There are 
two philosophically different approaches that could be taken to derive a numerical method 
on this grid: 
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^n+1 



^n+1 



A(0 



6 - 1/2 6 + 1/2 




%i- 1/2 ®t+l/2 



Fig. 6.7. The grid mapping //(f) maps a uniform computational grid in f-f space (on the left) to the 
nonuniform x-t grid in physical space (on the right). 



1 . Work directly in the physical space (the right side of Figure 6 . 7 ), and derive a finite 
volume method on the nonuniform gird for the integral form of the conservation law on 
the physical-grid cells. 

2 . View the nonuniform grid as resulting from some coordinate mapping applied to a uni- 
form grid in computational space. Such a grid is illustrated on the left side of Figure 6 . 7 , 
where Af is constant and the mapping function X(f ) defines jc,- — 1/2 = X(6 - 1/2)- If we 
can transform the equation in x and t to an equivalent equation in f and r, then we can 
solve the transformed equation on the uniform grid. 

The first approach is generally easier to use in developing finite volume methods for 
conservation laws, for several reasons: 

• The transformed equation in computational space includes metric terms involving X'(f ), 
as we will see below. The mapping must be smooth in order to achieve good accuracy if 
we attempt to discretize the transformed equations directly. In practice we might want to 
use a highly nonuniform grid corresponding to a nonsmooth function A(|). 

• Even if X(§) is smooth, it may not be easy to discretize the transformed equation in § 
in a way that exactly conserves the correct physical quantities in x. By contrast, a finite 
volume method in the physical space automatically achieves this if we write it in terms 
of fluxes at the cell edges. 

• Using a Godunov-type method requires solving a Riemann problem at each cell interface, 
and the transformed equations lead to a transformed Riemann problem. It is often simpler 
to solve the original, physically meaningful Riemann problem in x. 

For these reasons we will derive finite volume methods in the physical domain. However, 
once the method has been derived, we will see that we can then view it as a discretization 
of the transformed equation. This viewpoint is useful in implementing the methods, as we 
can then express the method as a finite volume method with a capacity function applied 
on the uniform f-grid. As we will see, the capacity k ,■ of the zth grid cell [6-1/2, 6+1/2] is 
then the ratio of the physical cell length A jc, = (x,+i/2 — x,-_ 1/2) to A£, which is a natural 
measure of the capacity of the computational cell. For a smooth mapping X($) this can 
be viewed as an approximation to the capacity function k(x ) = A'(f ), the Jacobian of the 
mapping, but the finite volume method remains valid and accurate even if the mapping is 
not smooth. 

In Chapter 23 , we will see that this approach extends naturally to quadrilateral grids 
in more than one space dimension. Again we will derive finite volume methods in the 
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irregular physical grid cells, but implement the methods using capacity-form differencing 
on a uniform rectangular grid. 

We now derive a finite volume method in the physical domain. We have 



r x i+ 

J Xi -\, 



q(x, 1 n+ \)dx — 



q{x, t n )dx 



r*i+ 

Jxi-i,- 

a <n + 1 rt„+i \ 

f{q{xi+ 1 / 2 , t)) dt - J f(q{xi- 1 /2 , t)) dtj, 



(6.70) 



which suggests the finite volume method 

e; ,+1 = Q'i - e:Vi) - hqu, q'DI 

where Ax,- = x, + i /2 — X/- 1/2 is the width of the ith cell and 



(6.71) 



Qi 



1 



f 



•/*-!/* 



g(x, t„)dx. 



The numerical flux T( Q " _ , . Q") should be, as usual, an approximation to 



1 

At 




f(q(xi- 1/2 ,t))dt. 



For Godunov’s method, this is determined simply by solving the Riemann problem and 
evaluating the flux along x/t — 0. The fact that the grid is nonuniform is immaterial in 
computing the Godunov flux. The nonuniformity of the grid does come into the second- 
order correction terms, however, since approximating the slopes q x with cell values requires 
paying attention to the grid spacing. This is discussed in Section 6.17.1. 

Next we will see that the method (6.71) can be reinterpreted as a method on the uniform 
computational grid. Let 



Ki - Axi/Af , 

where A£ is the uniform cell size in the computational grid shown on the left in Figure 6.7. 
Then the method (6.71) can be written as 

Qi +1 = Q! - ~^rz IHQl Qi + 1 ) - HQU’ Ql)l (6.72) 

K i 

Let 



q(£, t) = q(X(f), t ). 



If we now assume that the coordinate mapping X(f) is differentiable, then the change of 
variables x = X(f), for which dx = X'(§)</§, gives 




L 

L 



ti-1/2 

&+1/2 



q{X^),t)X'{^)dH 

q{^t)K(f)d^ 
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where the capacity function k(^) is the Jacobian X'(^), as expected. Hence the conservation 
law 



d r x i- 1 - 1/2 

— / q(x, t)dx = t)) - f(q(x i+l/2 , t)) 

Clt Jti- 1/2 

is transformed into 

d /*^'+V2 

— / <?(£> t)K(Z)d% = /(<?(£, -1/2, 0) _ /(<?(ti+l/2, ())■ 

dt hi- 1/2 

This has the form of the integral conservation law (6.63) in the computational domain. The 
finite volume method (6.71), when rewritten as (6.72), can be viewed as a capacity-form 
differencing method on the uniform f -grid. 

We can also interpret Q” as an approximation to a cell average of q over the computational 
domain: 



Q" ~ ^7 / q(x, t n )dx 



— r 

^Xi J x ._ 

-M 

r 

hi-M 



- 1/2 
fi+1/2 



q(h t n )K(h)d% 



j rh+ 1/2 

— / q(£,t n )d%. 

Hi— 1/2 



(6.73) 



6.17.1 High-Resolution Corrections 

One way to derive high-resolution methods for the advection equation q, + uq x = 0 (with 
u > 0, say) on a nonuniform grid is to use the REA algorithm approach from Section 6.4, 
which is easily extended to nonuniform grids. Given cell averages, we reconstruct a piece- 
wise linear function on each of the grid cells, evolve the advection equation exactly by 
shifting this function over a distance m At, and average onto the grid cells to obtain new cell 
averages. If the slopes are all chosen to be er" = 0, then this reduces to the upwind method 

Q- +1 = Q" - Q" - G7-0. 

K i 

which has the form (6.72) with IF(Q'‘_ \ , 6") = u Q "_ , . With nonzero slopes we obtain 



QV = Q'i - 



u At 
Ki 



(Qi - QU) 



~ ^ - « Ar ) a " - (ti - 1 A? - m A()<j]. (6.74) 

2 Ki A£ 

Recall that on a uniform grid the Lax-Wendroff method is obtained by taking slopes 
<y"_\ =(Q" ~ Q'I- i)/Ax. On a nonuniform grid the distance between cell centers is 
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Ki- 1/2 A£, where at,- —1/2 = 5(^-1 + k,), and the natural generalization of the Lax-Wendroff 
method is obtained by setting 



1 



Q" ~ QU 

Ki- 1/2 A§ 



This corresponds to a correction flux 



(6.75) 



Fi-l/2 



1 / «V-i 

2 \ Ki— 1/2 



u At \ 
— 1/2 A§ ) 



HQi-Qi- 1 )- 



(6.76) 



The natural generalization of this to systems of equations gives a correction flux similar to 
(6.68) but with the 1 replaced by k ,■ /at,_ 1/2: 
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(6.77) 



For smooth a: this is essentially the same as (6.68), but for nonsmooth k (6.77) might give 
better accuracy. 



Exercises 

6.1. Verify that (6.13) results from integrating the piecewise linear solution q n (x, t„ + 1). 

6.2. Compute the total variation of the functions 
(a) 



(b) 



q(x) = 



1 if x < 0, 
sin(7rx) if 0 < x < 3, 

2 if x > 3, 



1 

q(x) = 



2 



if x < 0 orx = 3, 

if 0 < x < 1 or 2 < x < 3, 

if 1 < x < 2, 

ifx > 3. 



6.3. Show that any TVD method is monotonicity-preserving. (But note that the converse 
is not necessarily true: a monotonicity-preserving method may not be TVD on more 
general data.) 

6.4. Show that (6.25) is valid by showing that, for any function qix), if we define discrete 
values Qi by averaging q(x) over grid cells, then TV ( Q ) < TW(q). Hint: Use the 
definition (6.19) and the fact that the average value of q lies between the maximum 
and minimum values on each grid cell. 

6.5. Show that the minmod slope guarantees that (6.23) will be satisfied in general, and 
hence the minmod method is TVD. 
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6.6. Show that taking 

= Q" - Q -- 1 

in (6.32) corresponds to using the downwind slope for o in both cases u > 0 and 
u < 0, and that the resulting flux gives the Lax-Wendroff method. 

6.7. Show that if u < 0 we can apply Theorem 6.1 to the flux-limiter method (6.41) by 
choosing 

cl i = o, 

D 'i = ~ v + 2 v(l + v ' ) 1 / 2 ) ^ ) 

in order to show that the method is TVD provided — 1 < v < 0 and the bound (6.44) 
holds. Hence the same restrictions on limiter functions are found in this case as 
discussed in Section 6.12. 

6.8. Verify that (6.47) is required for symmetry. 

6.9. Verify that (6.48) and (6.49) are equivalent and yield the Lax-Wendroff method. 

6.10. Plot the van Leer limiter from (6.39b), and verify that it lies in the Sweby region 
shown in Figure 6.6. 




■ 7 

Boundary Conditions and Ghost Cells 



So far we have only studied methods for updating the cell average Q'J assuming that we 
have neighboring cell values Q"_\ and Q" \ and perhaps values further away as needed in 
order to compute the fluxes F"_y 2 an d C+i/ 2 - practice we must always compute on some 
finite set of grid cells covering a bounded domain, and in the first and last cells we will not 
have the required neighboring information. Instead we have some set of physical boundary 
conditions, as discussed in Section 3.11, that must be used in updating these cell values. 

One approach is to develop special formulas for use near the boundaries, which will 
depend both on what type of boundary conditions are specified and on what sort of method 
we are trying to match. However, in general it is much easier to think of extending the 
computational domain to include a few additional cells on either end, called ghost cells, 
whose values are set at the beginning of each time step in some manner that depends on the 
boundary conditions and perhaps the interior solution. 

Figure 7.1 shows a grid with two ghost cells at each boundary. These values provide 
the neighboring-cell values needed in updating the cells near the physical domain. The 
updating formula is then exactly the same in all cells, and there is no need to develop 
a special flux-limiter method, say, that works with boundary data instead of initial data. 
Instead the boundary conditions must be used in deciding how to set the values of the ghost 
cells, but this can generally be done in a way that depends only on the boundary conditions 
and is decoupled entirely from the choice of numerical method that is then applied. 

Suppose the problem is on the physical domain [ a, b ], which is subdivided into cells 
C\,C 2 , ■ ■ ■ , Cjv with x\—a and xn + i = b, so that A.r = (b — a)/N. If we use a method for 
which F"_ 1/2 depends only on Q"_ l and Q" , then we need only one ghost cell on either end. 
The ghost cell C 0 = (a — Ax , a) allows us to calculate the flux F\/ 2 at the left boundary 
while the ghost cell C^ + \ —(b,b + Ax) is used to calculate F^ +l j 2 at x = b. With a flux- 
limiter method of the type developed in Chapter 6, we will generally need two ghost cells at 
each boundary, since, for example, the jump Qq — Q_ \ will be needed in limiting the flux 
correction in F\/ 2 . For a method with an even wider stencil, additional ghost cells would be 
needed. In general we might have m BC ghost cells at each side. 

We will refer to the solution in the original domain [a, b] as the interior solution ; it is 
computed in each time step by the numerical method. At the start of each time step we have 
the interior values Q", , Q n N obtained from the previous time step (or from the initial 
conditions if n = 0), and we apply a boundary-condition procedure to fill the ghost cells 
with values Q" for i = 1 — m BC , . . . , 0 and i = N + 1, . . . , N + m BC before applying the 
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Q- 1 


Qo 


Qi 


Q2 




Qn 


Qn+1 


Qn + 2 



















X l/2 X N+ i/ 2 

x = a x = b 



Fig. 7.1. The computational domain [a, h] is extended to a set of ghost cells for specifying boundary 
conditions. 

method on the next time step. In CLAWPACK this is done in a subroutine bcl . f that is called 
at the beginning of each time step. The default routine claw/clawpack/ld/lib/bcl 
implements most of the particular boundary conditions discussed in this chapter. We will 
look at several examples to see how the ghost-cell values might be set in order to implement 
various physical boundary conditions. In general we will discuss the boundary conditions 
for the case m BC = 2, but they can easily be extended to larger values if necessary. 



7.1 Periodic Boundary Conditions 

Periodic boundary conditions q(a, t) — q(b, t ) are very easy to impose with any numerical 
method. In updating g | we need values go to the left and g 2 to the right (for a three-point 
method). By periodicity the value go should agree with the value Q N in the last cell. One 
could code the formula for updating gi specially to use g v in place of the value Q,-i 
that would normally be used for i > 1, but it is simpler to use the ghost-cell approach and 
simply set Q{\ = Q" N before computing fluxes and updating the cell values, so that the same 
formula can then be used everywhere. With a five-point stencil we need to fill two ghost 
cells at each boundary, and we set 

g-t = g*-t, Go = Q"v’ Q"n +1 = Ql’ Qn + 2 = Qi (7.1) 

at the start of each time step. These boundary conditions are implemented in the 
CLAWPACK routine claw/clawpack/ld/lib/bcl . f and invoked by setting mthbc (1) = 2 
and mthbc (2) = 2. 



7.2 Advection 

Consider the advection equation on [a, b ] with u > 0 and the boundary condition 

q(a, t) = go(0, (7.2) 

where go(0 is a given function. Recall from Section 3.11 that we cannot specify a boundary 
condition at x — b. We may need to specify a boundary condition at x = b for the numerical 
method, however, if the stencil for computing g" +1 involves points to the right of Xj. 



7.2.1 Outflow Boundaries 

First we consider the outflow boundary at x = b. If we use a one-sided method such as 
upwind or Beam-Warming, then we do not need any numerical boundary condition. If we 
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implement the method using ghost cells, then we can assign arbitrary values to the ghost 
cells on the right with no effect on the interior solution, since these values will never be used. 

If we use a three-point method such as Lax-Wendroff that does use values to the right, 
then some numerical boundary conditions must be specified. We must in general take some 
care in how we specify these to obtain a stable and accurate method. 

One possibility is to use a fully upwind method at the rightmost point, together with the 
Lax-Wendroff method at all other points. This works quite well. Note, however, that the 
Lax-Wendroff method allows information to propagate from right to left, even though 
the exact solution to the advection equation does not. So there is the possibility that noise 
generated at the right boundary by this switch in method will propagate back into the domain 
and contaminate the solution elsewhere, and perhaps even cause an instability. The Lax- 
Wendroff method is highly dissipative for left-going waves, and so this does not cause a 
noticeable problem, and one can prove that the resulting method is stable. Indeed, a similar 
change in method occurs frequently with flux-limiter methods, with only slight loss in 
accuracy. 

In general, the theory of stability for the IB VP is much more subtle and difficult than for 
the Cauchy problem. See, for example, [427], [459] for an introduction to this topic. 

Rather than explicitly switching to a different formula at the right boundary, we can 
achieve the same effect by setting ghost cell values by extrapolation from the interior 
solution. If the ghost-cell value Q" N+l is set based on Q n N , Q%_ 1 , ■ ■ . , then the new value 
Q'jfi 1 will effectively be computed on the basis of values to the left alone, even if the formula 
depends on Q" N+1 , and hence this reduces to some sort of upwind method. The simplest 
approach is to use a zero-order extrapolation, meaning extrapolation by a constant function. 
We simply set 



Qn +1 = Qn, Q'n+2 = Q'n (7-3) 

at the start of each time step. Then we have A Q n N+i — 0 as the value & n N+l / 2 used in the 
flux modification to F^ + y 2 in (6.32). This flux then reduces to the upwind flux, 

K + X,2 = UQT N , 

since u > 0. Note that the method may not reduce to the standard first-order upwind method 
in the last cell, however, since ( = A Q n N _y 2 — Q'n Q'n- i f° r the Lax-Wendroff 

method) may be nonzero. But the resulting method behaves in roughly the same manner. 

One might also consider first-order extrapolation, based on fitting a linear function 
through the interior solution. This gives 

Q'n+i = Qn + (Q'n - Q'n- i) = 2 Q" N - Q'n- i- (7-4) 

In this case A Q' l N+1 j 2 — A Q n N _y 2 and the correction terms in F£_ y 2 and F^ + X / 2 will cancel 
out (assuming the limiter function satisfies 0{\) — 1). Now the update for Q n n +l does reduce 
to the standard first-order upwind method. 

The idea of extrapolation at outflow boundaries turns out to be extremely powerful more 
generally, even for systems of equations where there are both incoming and outgoing charac- 
teristics. Often we have artificial computational boundaries that arise simply because we can 
only solve the problem on a bounded domain. At such boundaries we often want to have no 
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incoming signal, though there may be outgoing waves that should leave the domain cleanly 
without generating spurious reflections at the artificial boundary. Zero-order extrapolation, 
coupled with the methods we are studying that perform characteristic decomposition in the 
solution to each Riemann problem, is often a very effective way to accomplish this. This is 
discussed in Section 7.3.1 for linear acoustics and will be investigated later for nonlinear 
and multidimensional problems. First-order extrapolation might seem even better, but can 
lead to stability problems and is not recommended in general. 

Zero-order extrapolation boundary conditions are implemented in CLAWPACK as one 
of the options that can be automatically invoked when using clawlez as described in 
Chapter 5. The ghost-cell values are set in claw/clawpack/ld/lib/bcl .f and invoked 
by setting mthbc (i) = 1, where i=l for the left boundary or i=2 for the right boundary. 
(See Section 5.4.6.) 



7.2.2 Inflow Boundary Conditions 

Now consider the inflow boundary at x — a for the advection equation, where we specify 
the boundary condition (7.2). One approach would be to compute the exact flux F'/p by 
integrating along the boundary, 



1 f'" + ' _ 

F|" 2 = — J uq(a, t)dt 

I* tn + 1 

= — J go(t)dt, (7.5) 

and use this in the flux-differencing formula for Q" + 1 . Alternatively, a second-order accurate 
approximation such as 



Fy 2 = ugo(t„ + At/2 ) 



(7.6) 



could be used instead. 

Note that for a five-point method we also need a special formula for the flux F", 2 . which 
might be more difficult to compute by this approach. 

Again, for generality it is often simpler instead to find a way to set the ghost-cell values 
Q 1 / (and perhaps Q n _ x ) so that the same interior method can be used at all points. For the 
advection equation we can easily compute sensible values using our knowledge of the exact 
solution. We would like to set 



i r 

ai =^L 



q(x, t„)dx. 



Of course the true solution isn’ t even defined for x < a , but we can easily extend the solution 
past the boundary using our knowledge of characteristics, setting 



q(x, t„) = t n + 

= goft/1 + 



a — x 



u 

a — x 
u 



(7.7) 
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Then 



0"- — 
~ Ax 



f 

u ft' 

AX Jtn 



gol tn + 



u 



fa— Ax 
n t n +Ax/u 

— I go(r)dr. 



dx 



(7.8) 



Again we could approximate this integral by the second-order midpoint approximation, 
obtaining 



Q"o = go(t„ + ^\ (7.9) 

which is also what we would get if we simply evaluated the cell-center value q(a — Ax /2, t n ) 
using (7.7). Similarly, for the second ghost cell we could set 

0 “ 1 = gO ( ? ' , + ^)- 

Specifying such boundary conditions in CLAWPACK requires modifying the bcl routine 
to set the ghost-cell values appropriately. See [claw/book/chap7/advinf low] for an 
example. 



7.3 Acoustics 

Similar ideas can be applied to develop boundary-condition procedures for systems of 
equations. The situation may be complicated by the fact that the system can have both 
positive and negative eigenvalues, so that each boundary will typically have both incoming 
and outgoing characteristics. Other sorts of physical boundary conditions, such as a solid 
wall where waves are expected to reflect, must also be studied. Many of these issues can be 
illustrated for the simple case of linear acoustics as developed in Section 2.7, a linear system 
of two equations with one eigenvalue of each sign. The boundary-condition procedures 
developed here will later be seen to extend very easily to nonlinear systems such as the full 
Euler equations of compressible flow. 

We consider the acoustics equations from (2.52), 



Pt + K 0 u x = 0 
A) Mr + Px = 0, 



(7.10) 



with a variety of different boundary conditions. The characteristic variables for this system 
are 



U)\x, t ) = — — (-p + Z 0 u ), 

ZZo 



w 2 (x,t)= — — ( p + z 0 u ), 

ZZo 



(7.11) 



where Zq is the impedance, as derived in Section 2.8. 
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7.3.1 Nonreflecting Boundary Conditions 

Suppose we wish to solve the Cauchy problem with initial data u(x ) and p(x) that vary with 
x only in some region a\ < x < b i, and are constant outside this interval, say 

, , (. Pl,u l ) if x < a\, 

(P, u) = 

(Pr.ur) if x > b\. 



We know from Chapter 3 that the solution to the Cauchy problem will consist of two waves, 
each with fixed shape depending on u and p. One propagates to the left with speed — cq 
and the other to the right with speed cq. Eventually these waves will separate completely 
from one another (certainly for times t > {b\ — ci\)/cq) and leave a new constant state in 
between. Note that the characteristic variables (7.11) satisfy 



w~(ai,t)— ——(pl + Z 0 u l ) for all f > 0, 
2Zo 



w\b u t) — ——(~p R + Zqu r ) for all t >0. 
ZZq 



If we now want to compute this solution numerically, we must choose some finite domain 
a < x < b on which to perform the computation, and we will suppose this domain includes 
the interval where the data is specified, i.e., a < a\ <b\ < h. For short times the solution 
should remain constant at the boundaries a and b, but eventually the acoustic waves should 
pass out through these boundaries. If we want to compute over longer times, then we must 
specify the boundary conditions in such a way that the waves leave the region cleanly, 
without generating any signal in the incoming characteristic variable propagating back into 
the computational domain. 

The points x — a and x = b are artificial boundaries, in the sense that they are only intro- 
duced to make the computational domain finite, and do not correspond to physical bound- 
aries. Hence we want to impose nonreflecting boundary conditions at these boundaries that 
do not introduce spurious reflections of the outgoing acoustic waves. Such boundary condi- 
tions are also called absorbing boundary conditions, since they are supposed to completely 
absorb any wave that hits them. Such boundary conditions are extremely important in many 
practical applications. We often wish to model what is happening in a small portion of 
physical space that must be truncated artificially at some point. Consider the problem of 
modeling seismic waves in the earth, or flow around an airplane. We cannot hope to model 
the entire earth or atmosphere, nor should we need to in order to understand localized phe- 
nomena. We want to cut off the domain far enough away that we will have a good model, 
but as close in as possible to reduce the size of the computational domain and the computing 
time required. Specifying good absorbing boundary conditions is often crucial in obtaining 
useful results. A variety of sophisticated approaches have been developed for specifying 
appropriate numerical boundary conditions. See Section 21.8.5 for some references and 
further discussion. 

With Godunov-type methods that involve solving the Riemann problem at each inter- 
face, it turns out that simply using zero-order extrapolation often gives a reasonable set 
of absorbing boundary conditions that is extremely simple to implement. For the simple 
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one-dimensional acoustics problem, we can analyze this choice completely using the char- 
acteristic decomposition. 

In fact, one approach to solving the problem for this linear system would be to diagonalize 
the equations, obtaining scalar advection equations for the characteristic variables w 1 and ur 
from (7.11). The numerical method can also be diagonalized and yields updating formulas 
for W ] — (—Q 1 + ZqQ 2 )/2Zq and W 2 = (Q l + ZqQ 2 )/2Zq. At the boundary x = a, 
for example, the variable w 1 is the outgoing variable and we already know that zero-order 
extrapolation can be used for this variable from Section 7.2.1. For the incoming variable 
wr we want to set the value as a function go(t), following Section 7.2.2. Since we want 
to insure that no signal flows into the domain, regardless of what is flowing out in w l , the 
correct value to set is 

2 1 

w (a, t) = — — (p L + Z 0 u L ) 

ZZo 

for all t (recall (7.12)). Following Section 7.2.1 this would suggest setting both the ghost 
cell values to this value, 

W 0 2 = ^-{pl + Z 0 u L ), Wl t = -U p L + Zqu l ), (7.13) 

ZZo Z^o 

together with extrapolation of IF 1 . From these ghost-cell values for W we could then 
compute the required ghost-cell values for Q. 

However, since W 2 is already constant near the point i = fl,we expect that W 2 — ( pi + 
Z{)U / )/2Zo, and so the boundary conditions (7.13) can also be obtained by simply using 
zero-order extrapolation for W 2 as well as for W 1 . But now, if we extrapolate both of the 
characteristic variables and then compute Qo and Q-i, we will find that these values are 
simply what would be obtained by zero-order extrapolation of Q. So we do not need to go 
through the diagonalization at all in setting the boundary conditions, but can simply set 

Q'o=Q ", 2_i = Ql 



in each time step. 

Note that by setting Qo — Q\ we insure that the solution to the Riemann problem at the 
interface X \ /2 consists of no waves, or more properly that the wave strengths are all 
zero. So in particular there are no waves generated at the boundary regardless of what is 
happening in the interior, as desired for nonreflecting boundary conditions. 

There are also no outgoing waves generated at jci/ 2 , which may be incorrect if an acoustic 
wave in w 1 should in fact be leaving the domain. But any outgoing waves would only be 
used to update the solution in the ghost cell Co, and it is not important that we update this 
cell properly. In fact, the value Qo is not updated at all by the interior algorithm. Instead 
this value is reset by extrapolation again at the start of the next time step. 



7.3.2 Incoming Waves 

In the previous section we assumed that we wanted incoming waves to have zero strength. 
In some problems we may need to specify some incoming signal. For example, we might 
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wish to study what happens when a sinusoidal sound wave at some frequency w hits a region 
with different material properties, in which case some of the energy will be reflected (see 
Section 9.6). Suppose the material properties vary only in a region a\ < x < b\ and we 
wish to compute on a larger region a < x < b. Then we want to impose the boundary 
condition 



w 2 (a, t) — sin(<wt) (7.14) 

as the incoming signal, together with nonreflection of any outgoing signal that has been 
generated within the domain. We must now apply some characteristic decomposition in 
the process of applying the boundary procedure in order to impose the correct boundary 
conditions. If we decompose Q\ into g i = IT, 1 r 1 + W 2 r 2 . then the ghost cell value go 
should be set as 



go = Vk/r 1 + sin(a>(t„ + Ax/2 c 0 )) r 2 . 

Alternatively, we could first extrapolate the whole vector g and then reset the w 2 component, 
setting 



go = gi + (sin (a>(t n + Ax/2 c 0 )) - Wj) r 2 . 

See [claw/book/chap7/acouinf low] for an example. 

For the acoustics system with only two equations, the two approaches are essentially 
equivalent in terms of the work required. But if we had a larger system of m equations and 
wanted to impose an incoming wave in only one characteristic family, e.g., 

w J (a, t ) = g 0 (t), 

for some j (with zero-strength signal in other incoming characteristics and nonreflection 
of outgoing waves), then we could set 

go = gi + [goOn + Ax/2X J ) - W(] r J , 

where W( — V Q \ is the eigen-coefficient of r 1 in gi and )J is the corresponding eigen- 
value. 



7.3.3 Solid Walls 

Consider a tube of gas with a solid wall at one end. In this case we expect acoustic waves 
to reflect at this boundary in a particular way - see Section 3.11 for the relation between 
w 1 and w 2 . Rather than working with the characteristic variables, however, in this case it is 
easier to return to the physical boundary condition itself. For a solid wall at x = a this is 



u{a, t) = 0. 



(7.15) 
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The key observation is the following: Suppose we take any data (p°(x), u°(x)) defined for 
all x > a and extend it to x < a by setting 

P°(a - %) = p°(a + %), 

(7.16) 

M°(a — f) = —u°(a + £), 

for i; > 0. Then if we solve the Cauchy problem with this extended data, the solution we 
obtain for x > a will agree exactly with the solution to the half-space problem on x > a 
with a solid wall boundary condition (7.15) at x — a. This follows from symmetry: the 
conditions (7.16) will continue to hold for t > 0 and in particular uia) = —u(a) must be 
zero. See also Exercise 7.2. 

This suggests the following formulas for ghost-cell values in each time step: 



forgo: Po = Pu u 0 = -u i, (7 17) 

for g_i: p-\ = p 2 , U—\ — u 2 . 

This imposes the necessary symmetry at the start of each time step. 

Solid-wall boundary conditions are implemented in the CLAWPACK library routine 
claw/clawpack/ld/lib/bcl . f and invoked by setting mthbc(i) =3, where i = l for 
the left boundary or i = 2 for the right boundary. This assumes that the solid-wall bound- 
ary condition can be set by reflecting all components of g and then negating the second 
component, as in (7.17). This works for the acoustics equations and also for several other 
systems of equations that we will study in this book, including the shallow water equations 
and several forms of the gas dynamics equations. Related boundary conditions can also be 
used for elastic waves in solids with fixed or free boundaries; see Section 22.4. 



7.3.4 Oscillating Walls 

Now suppose that the solid wall at x — a is oscillating with some very small amplitude, 
generating an acoustic wave in the gas. This is a common situation in acoustics: small- 
scale molecular vibrations of solid objects give rise to many familiar sounds. For very 
small-amplitude motions we can still use the linear acoustics equations on the fixed domain 
a < x < b but with the boundary condition 

u(a,t)—U(t ) (7.1 8) 

to simulate the vibrating wall. For a pure-tone oscillation we might take 

U(t) — e sin(&)f), (7.19) 

for example. We can implement this by setting the following ghost-cell values: 

forgo: Ro = Fi, w 0 = 2U(t„) - u\, (7 20) 

forg-i: p-i = p 2 , m_i = 2U(t„) - u 2 . 

These reduce to (7.17) if U(t) = 0. The rationale for this set of boundary conditions is 
explored in Exercise 7.2. 
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Exercises 

7.1. If we use the ghost-cell value Q" } from (7.9) in the Lax-Wendroff method, what flux 
F"j 2 will be computed, and how does it compare with (7.6)? Note that if the Courant 
number is near 1, then A x/u ~ A t. 

7.2. (a) For acoustics with a solid-wall boundary, we set the ghost-cell values (7.17) 

and then solve a Riemann problem at x 1/2 = a with data 




Show that the solution to this Riemann problem has an intermediate state q* 
with n* = 0 along the wall, another reason why this is the sensible boundary 
condition to impose. 

(b) Give a similar interpretation for the oscillating-wall boundary conditions (7.20). 

7.3. The directory [claw/book/chap7/standing] models a standing wave. The acous- 
tics equations are solved with p 0 — Kq — 1 in a closed tube of length 1 with with 
initial data p(x) — cos(2ttx) and u(x) — 0. Solid-wall boundary conditions are 
used at each end. Modify the input data in clawlez . data to instead use zero-order 
extrapolation at the right boundary x — 1 . Explain the resulting solution using the 
theory of Section 3.11. 




8 

Convergence, Accuracy, and Stability 



Whenever we use a numerical method to solve a differential equation, we should be con- 
cerned about the accuracy and convergence properties of the method. In practice we must 
apply the method on some particular discrete grid with a finite number of points, and we 
wish to ensure that the numerical solution obtained is a sufficiently good approximation 
to the true solution. For real problems we generally do not have the true solution to com- 
pare against, and we must rely on some combination of the following techniques to gain 
confidence in our numerical results: 

• Validation on test problems. The method (and particular implementation) should be tested 
on simpler problems for which the true solution is known, or on problems for which a 
highly accurate comparison solution can be computed by other means. In some cases 
experimental results may also be available for comparison. 

• Theoretical analysis of convergence and accuracy. Ideally one would like to prove that 
the method being used converges to the correct solution as the grid is refined, and also 
obtain reasonable error estimates for the numerical error that will be observed on any 
particular finite grid. 

In this chapter we concentrate on the theoretical analysis. Here we consider only the Cauchy 
problem on the unbounded spatial domain, since the introduction of boundary conditions 
leads to a whole new set of difficulties in analyzing the methods. We will generally assume 
that the initial data has compact support, meaning that it is nonzero only over some bounded 
region. Then the solution to a hyperbolic problem (which has finite propagation speeds) will 
have compact support for all time, and so the integrals over the whole real line that appear 
below really reduce to finite intervals and we don’t need to worry about issues concerning 
the behavior at infinity. 



8.1 Convergence 

In order to talk about accuracy or convergence, we first need a way to quantify the error. We 
are trying to approximate a function of space and time, and there are many possible ways to 
measure the magnitude of the error. In one space dimension we have an approximation Q" 
at each point on space-time grid, or in each grid cell when using a finite volume method. 
For comparison we will let q " represent the exact value we are hoping to approximate well. 
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For a finite difference method we would probably choose the pointwise value 

q- = q(x„ t n ), ( 8 . 1 ) 

while for a finite volume method we might instead want to compare Q” with 

J /•-'■/+ 1/2 

q'i = — / q(x, t„)dx. ( 8 . 2 ) 

If the function q(x, t ) is sufficiently smooth, then the pointwise value (8.1) evaluated at 
the cell center x,- agrees with the cell average (8.2) to 0(Ax 2 ), and so for the methods 
considered in this book (which are generally at most second-order accurate), comparison 
with the pointwise value can be used even for finite volume methods and is often simpler. 

To discuss convergence we must first pick some finite time T over which we wish to 
compute. We expect errors generally to grow with time, and so it would be unreasonable to 
expect that any finite grid would be capable of yielding good solutions at arbitrarily large 
times. Note that as we refine the grid, the number of time steps to reach time T will grow 
like T /At and go to infinity (in the limit that must be considered in convergence theory), 
and so even in this case we must deal with an unbounded number of time steps. We will 
use N to indicate the time level corresponding to time T = N At. The global error at this 
time will be denoted by 

E n =Q n -q N , 

and we wish to obtain bounds on this grid function as the grid is refined. 

To simplify notation we will generally assume that At and Ax are related in a fixed 
manner as we refine the grid. For hyperbolic problems it is reasonable to assume that the 
ratio At / Ax is fixed, for example. Then we can speak of letting At — > 0 to refine the grid, 
and speak of convergence with order s if the errors vanish like 0( At s ) or as CH Ax' ), which 
are the same thing. 



8.1.1 Choice of Norms 

To quantify the error, we must choose some norm in which to measure the error at a fixed 
time. The standard set of norms most commonly used are the p-norms 



= Ax £ \Ej\ 



i /p 



(8.3) 



These are discrete analogues of the function-space norms 

( roo \ i Ip 

J \E{x)\ p dxj . (8.4) 

Note that the factor Ax in (8.3) is very important to give the correct scaling and order of 
accuracy as the grid is refined. 

In particular, the 1-norm (with p= 1) is commonly used for conservation laws, since 
integrals of the solution itself are of particular importance. The 2-norm is often used for 
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linear problems because of the utility of Fourier analysis in this case (the classical von 
Neumann analysis of linear finite difference methods; see Section 8.3.3). We will use ||-|| 
without any subscript when we don’t wish to specify a particular norm. Note that for a 
system of m equations, E e R'" and the absolute value in (8.3) and (8.4) represents some 
vector norm on R m . 

We say that the method is convergent at time T in the norm ||-|| if 

lim 11^11=0. 

At— >0 

NAt=T 

The method is said to be accurate of order s if 

\\E N \\ =(D(At s ) as Ar — »■ 0. (8.5) 

Ideally we might hope to have pointwise convergence as the grid is refined. This amounts 
to using the max norm (or oo-norm) to measure the error: 

II E || 00 = max | Ei\. (8.6) 

—oo<i <00 

This is the limiting case p oo of (8.3). 

If the solution q(x, t) is smooth, then it may be reasonable to expect pointwise conver- 
gence. For problems with discontinuous solutions, on the other hand, there will typically 
always be some smearing at one or more grid points in the neighborhood of the disconti- 
nuity. In this case we cannot expect convergence in the max norm, no matter how good the 
method is. In such cases we generally don’t care about pointwise convergence, however. 
Convergence in the 1-norm is more relevant physically, and we may still hope to obtain 
this. In general, the rate of convergence observed can depend on what norm is being used, 
and it is important to use an appropriate norm when measuring convergence. Differences 
are typically greatest between the max norm and other choices (see Section 8.5 for another 
example). Except in fairly rare cases, the 1-norm and 2-norm will give similar results, and 
the choice of norm may depend mostly on which yields an easier mathematical analysis, 
e.g., the 1-norm for conservation laws and the 2-norm for linear equations. 



8.2 One-Step and Local Truncation Errors 

It is generally impossible to obtain a simple closed-form expression for the global error 
after hundreds or thousands of time steps. Instead of trying to obtain the error directly, 
the approach that is widely used in studying numerical methods for differential equations 
consists of a two-pronged attack on the problem: 

• Study the error introduced in a single time step, showing that the method is consistent 
with the differential equation and introduces a small error in any one step. 

• Show that the method is stable, so that these local errors do not grow catastrophically 
and hence a bound on the global error can be obtained in terms of these local errors. 

If we can get a bound on the local error in an appropriate sense, then stability can be used 
to convert this into a bound on the global error that can be used to prove convergence. 
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Moreover we can generally determine the rate of convergence and perhaps even obtain 
reasonable error bounds. The fundamental theorem of numerical methods for differential 
equations can then be summarized briefly as 

consistency + stability •<=>■ convergence. (8.7) 

This theorem appears in various forms in different contexts, e.g., the Lax equivalence 
theorem for linear PDEs (Section 8.3.2) or Dahlquist’s equivalence theorem for ODEs. The 
exact form of “stability” needed depends on the type of equation and method. In this section 
we will study the local error, and in Section 8.3 we turn to the question of stability. 

A general explicit numerical method can be written as 

Q n+1 =Af(Q n ), 

where J\f(-) represents the numerical operator mapping the approximate solution at one time 
step to the approximate solution at the next. The one-step error is defined by applying the 
numerical operator to the true solution (restricted to the grid) at some time and comparing 
this with the true solution at the next time: 

one-step error = A f(q n ) — q n+l . (8.8) 

Here q n and q n+1 represent the true solution restricted to the grid by (8.1) or (8.2). This 
gives an indication of how much error is introduced in a single time step by the numerical 
method. The local truncation error is defined by dividing this by At: 

r n = ~^mq n ) - q n+l ]. (8.9) 

As we will see in Section 8.3, the local truncation error typically gives an indication of 
the magnitude of the global error, and particularly the order of accuracy, in cases when the 
method is stable. If the local truncation error is 0( Ax s ) as ,v — »■ 0, then we expect the global 
error to have this same behavior. 

We say that the method is consistent with the differential equation if the local truncation 
error vanishes as At -* 0 for all smooth functions q(x, t) satisfying the differential equation. 
In this case we expect the method to be convergent, provided it is stable. 

The local truncation error is relatively easy to investigate, and for smooth solutions can 
be well approximated by simple Taylor series expansions. This is illustrated very briefly in 
the next example. 

Example 8.1. Consider the first-order upwind method for the advection equation with 
u > 0, 

e; ,+l = q" - ^ u{q; - qu)- 

Applying this method to the true solution gives the local truncation error 

= t„) - ^ u[q{Xi , t„) - q(Xi_i, t n )\ - q(x t , f„+i)). 



r 



(8.10) 




8.3 Stability Theory 



143 



Wenow expand g(x;_i, f„) and g(x,-, t n+ \) in Taylor series about (xj, f„)and cancel common 
terms to obtain 

r" = ~[q,(Xi, t„) + uq x (xj , t„)] + ^ Ax uq xx {x t , t „) - ^ At q„(Xi, t n ) H . (8.1 1) 

The first term in this expression is identically zero, because we assume that q is an exact 
solution to the advection equation and hence q, + uq x = 0, so we find that 

1 _ 1 , 

Upwind: r' ! = -u Ax q xx (x n t„) - - At q„(x h t„) + 0(At ) 

= ^ u Ax (1 - v)q xx (xi, t„) + 0(At 2 ), (8.12) 

where 



v = u At / Ax (8.13) 

is the Courant number. The truncation error is dominated by an 0( Ax) term, and so the 
method is first-order accurate. 

Similarly, one can compute the local truncation errors for other methods, e.g., 

1 l A x 2 \ 

Lax-Friedrichs: r” = - ( — u 2 At J q xx (Xi , t n ) + 0(At 2 ) (8.14) 

= Ax (1/v - v)q xx (x,, t„) + OiAt 2 ), (8.15) 

Lax-Wendroff: r" = — m(Ax) 2 ( 1 — v 2 )q xxx (xj , t n ) + 0(At 3 ). (8.16) 

6 

Note that the Lax-Wendroff method is second-order accurate and the dominant term 
in the truncation error depends on the third derivative of q , whereas the upwind and 
Lax-Friedrichs methods are both first-order accurate with the dominant error term de- 
pending on q xx . The relation between these errors and the diffusive or dispersive nature of 
the methods is discussed in Section 8.6. 



8.3 Stability Theory 

In this section we review the basic ideas of stability theory and the derivation of global 
error bounds from information on the local truncation error. The form of stability bounds 
discussed here are particularly useful in analyzing linear methods. For nonlinear methods 
they may be hard to apply, and in Section 12.12 we discuss a different approach to stability 
theory for such methods. 

The essential requirements and importance of stability can be easily seen from the fol- 
lowing attempt to bound the global error using a recurrence relation. In time step n suppose 
we have an approximation Q" with error E " , so that 



Q' 1 = q n + E". 
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We apply the numerical method to obtain Q" +l : 

Q n+1 = Af(Q n ) = M(q n + E"), 

and the global error is now 

E n+ 1 = Q" +1 - q n+1 

= A f(q n + E n )~ q n+1 

= U{q n + £") - Af(q") + A/V) - q n+l 

= [A f{q n + E") - Af(q")] + At r". (8.17) 

By introducing A f (q n ) we have written the new global error as the sum of two terms: 

• J\f(q" + E") — Af(q n ), which measures the effect of the numerical method on the previous 
global error E", 

• At t", the new one-step error introduced in this time step. 

The study of the local truncation error allows us to bound the new one-step error. Stability 
theory is required to bound the other term, AT {q n + E") — A f(q n ). 



8.3.1 Contractive Operators 

The numerical solution operator A/"(-) is called contractive in some norm ||-|| if 

\\Af(P) - Af(Q)\\ <\\P - Q\\ (8.18) 

for any two grid functions P and Q. If the method is contractive, then it is stable in this norm 
and we can obtain a bound on the global error from (8. 17) very simply using P — q n + E" 
and Q — q": 

||£' ,+1 || < HACC^" + £•”) — A/'(^”)|| + Ar ||r' ! || 

< || £"11 + At ||t"||. (8.19) 

Applying this recursively gives 

N - 1 

\\E n \\ < ||£°|| + At Ill'll- 

n= 1 

Suppose the local truncation error is bounded by 

lit II = max ||r"||. 

0 <n<N 

Then we have 

|| £^11 < ||£°|| + N At || t || 



(for N At = T). 



(8.20) 
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The term ||£°|| measures the error in the initial data on the grid, and we require that 
|| £°|| — > 0 as At -> 0 in order to be solving the correct initial-value problem. If the method 
is consistent, then also ||r|| -> 0 as A f — > 0 and we have proved convergence. Moreover, 
if || r || = 0(At s ), then the global error will have this same behavior as At -> 0 (provided 
the initial data is sufficiently accurate), and the method has global order of accuracy s. 

Actually a somewhat weaker requirement on the operator A f is sufficient for stability. 
Rather than the contractive property (8.18), it is sufficient to have 

\\W)-MQ)\\ < (l+aAf)||P- 0|| (8.21) 

for all P and Q , where a is some constant independent of At as At — > 0. (Recall that 
the one-step operator J\f depends on At even though we haven’t explicitly included this 
dependence in the notation.) If (8.21) holds then the above proof still goes through with a 
slight modification. We now have 

||£' !+1 || < (1 +a Ar)||£' ! || + At || t || , 



|| £*11 < (1 + a At) A '||£°|| + At ^(1 + a Ar) A '- 1 -"||r || 

n= 1 

< e“ r (||£°|| + r||r||) (for N At — T). (8.22) 

In this case the error may grow exponentially in time, but the key fact is that this growth is 
bounded independently of the time step At. For fixed T we have a bound that goes to zero 
with At. This depends on the fact that any growth in error resulting from the operator A/As 
at most order O(At) in one time step, which is what (8.21) guarantees. 



8.3.2 Lax-Richtmyer Stability for Linear Methods 

If the operator A/"(-) is a linear operator, then N (q n + £") = A f{q n ) + Af(E n ), and so 
Af(q" + £") — J\f(q") reduces to simply A /"(£")• In this case, the condition (8.21) reduces 
simply to requiring that 



||A/’(£")|| < (1 + a At)||£"|| (8.23) 

for any grid function E " , which is generally simpler to check. This can also be expressed 
as a bound on the norm of the linear operator J\f , 

HA/'H < 1 +a At. (8.24) 

An even looser version of this stability requirement can be formulated in the linear case. 
We really only need that, for each time T, there is a constant C such that 

|| A/ - " || < C (8.25) 

for all n < N — T/ At, i.e., the nth power of the operator J\f is uniformly bounded up to this 
time, for then all the terms in (8.22) are uniformly bounded. For linear methods, this form 
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of stability is generally referred to as Lax-Richtmyer stability. The result (8.7) is called the 
Lax equivalence theorem in this context. See [369] for a rigorous proof. Note that if (8.24) 
holds, then we can take C = e aT in (8.25). 

Classical methods such as the first-order upwind or the Lax-Wendroff method for the 
linear advection equation are all linear methods, and this form of the stability condition 
can be used. (See Section 8.3.4 for an example.) The high-resolution methods developed in 
Chapter 6 are not linear methods, however, since the limiter function introduces nonlinearity. 
Proving stability of these methods is more subtle and is discussed briefly in Section 8.3.5 
and Chapter 15. 



8.3.3 2-Norm Stability and von Neumann Analysis 

For linear difference equations, stability analysis is often particularly easy in the 2-norm, 
since Fourier analysis can then be used to simplify the problem. This is the basis of von 
Neumann stability analysis, which is described more completely in many books on finite 
difference methods for partial differential equations (e.g., [333] or [427]). 

Let Q" (— oo < / < oo ) represent an arbitrary grid function for the Cauchy problem. In 
this section we use I for the grid index (x/ — I Ax) so that i — -f—l can be used in the 
complex exponentials below. We suppose that Q " has finite norm, so that it can be expressed 
as a Fourier series 



Q 



« = _L [ 

1 V2n J — c 






(8.26) 



Applying a linear finite difference method to Q'j and manipulating the exponentials typically 
gives an expression of the form 



Q 



"+i = J_ [ 
\p2jt J- 



Q^)g^,Ax,At)e^ IAx d$, 



(8.27) 



where g ( £, Ax, At) is called the amplification factor for wave number £, since 

Q n+ \H) = g(£, Ax, At) g'm (8.28) 

The 2-norm is now convenient because of Parseval’s relation, which states that 

iiei2 = iie"ii2, (8.29) 

where 



\Q"h= Ux J |G';| 2 '| and \\Q n \\ 2 =(f_ 



Q n m 2 dH 



1/2 



To show that the 2-norm of Q n remains bounded it suffices to show that the 2-norm of Q" 
does. But whereas all elements of Q" (as I varies) are coupled together via the difference 
equations, each element of Q n (as § varies) satisfies an equation (8.28) that is decoupled 
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from all other wave numbers. (The Fourier transform diagonalizes the linear difference 
operator.) So it suffices to consider an arbitrary single wave number £ and data of the form 

Q n I= e ifIAx . (8.30) 

From this we can compute g($, Ax, At). Then requiring that |g(§, Ax, Af)| < 1 for all £ 
gives a sufficient condition for stability. In fact it suffices to have | g(£ , Ax , At )| < 1 + a At 
for some constant a independent of § . 

Example 8.2. Consider the upwind method (4.25) for the advection equation q, + uq x — 0 
with ii > 0. Again use v = u At / Ax as shorthand for the Courant number, and write the 
upwind method (4.25) as 



QT 1 = Q" - HQ" - G7_ t) 

= (1 ~v)Q n I + vQ n I _ 1 . (8.31) 

We will use von Neumann analysis to demonstrate that this method is stable in the 2-norm 
provided that 



0 < v < 1 (8.32) 

is satisfied, which agrees exactly with the CFL condition for this method (see Section 4.4). 
Using the data (8.30) yields 

Q'j +1 = (1 - v)e iHIAx + ve m ~ l)Ax 
= [(1 - v)+ ve~^ Ax ]e^ IAx 

= g(S, A.r, At) Q'j (8.33) 

with 



g($, Ax, At) = (l-v)+ ve~^ Ax . (8.34) 

As i; varies, gO). Ax. At) lies on a circle of radius v in the complex plane, centered on the 
real axis at 1 — v. This circle lies entirely inside the unit circle (i.e., |g| < 1 for all § ) if and 
only if 0 < v < 1, giving the stability limit (8.32) for the upwind method. 



8.3.4 1-Norm Stability of the Upwind Method 

For conservation laws the 1-norm is often used, particularly for nonlinear problems. We 
will demonstrate that the upwind method (4.25) considered in the previous example is also 
stable in the 1-norm under the time-step restriction (8.32). We revert to the usual notation 
with i as the grid index and write the upwind method (8.31) as 



G” +1 =(1- v)Q1 + vQU, 



(8.35) 
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where v is again the Courant number (8.13). From this we compute 

|| G " +1 || 1 = 

i 

= /sxY J \v-m n i +vQ n i _ l \ 

i 

< AxJ2[(l-v)\Ql\ + v\QU\l (8.36) 

i 

In the final step we have used the triangle inequality and then pulled 1 — v and v outside 
the absolute values, since these are both positive if (8.32) is satisfied. The sum can be split 
up into two separate sums, each of which gives || 2" II i, obtaining 

iie” +1 iii<(i-v)iie"ii 1 + v|ie''ii 1 = ne n iii. 

This proves stability in the 1-norm. Note that this only works if (8.32) is satisfied, since we 
need both 1 — v and v to be positive. 



I Q1 



M+l I 



8.3.5 Total-Variation Stability for Nonlinear Methods 
For a nonlinear numerical method, showing that (8.23) holds is generally not sufficient 
to prove convergence. The stronger contractivity property (8.21) would be sufficient, but 
is generally difficult to obtain. Even for the linear advection equation, the high-resolution 
methods of Chapter 6 are nonlinear (since the limiter function depends on the data), and so 
a different approach to stability must be adopted to prove convergence of these methods. 

The total variation introduced in Section 6.7 turns out to be an effective tool for studying 
stability of nonlinear problems. We make the following definition. 

Definition 8.1. A numerical method is total-variation bounded ( TVB ) if, for any data Q° 
(with TV(£7°) < oo) and time T , there is a constant R > 0 and a value Ato > 0 such that 

T V(Q”) < R (8.37) 



for all n At < T whenever At < Ato. 

This simply requires that we have a uniform bound on the total variation up to time T on 
all grids sufficiently fine (and hence as At — > 0). 

In Section 12.12 we will see how this can be used to prove convergence of the numerical 
method (using a more subtle argument than the approach taken above for linear problems, 
based on the compactness of an appropriate function space). For now we just note that, 
in particular, a method that is TVD (see Section 6.7) is certainly TVB with R = TV( Q°) 
in (8.37) for any T. So the notion of a TVD method, useful in insuring that no spurious 
oscillations are introduced, is also sufficient to prove convergence. In particular the high- 
resolution TVD methods introduced in Chapter 6 are all convergent provided the CFL 
condition is satisfied (since this is required in order to be TVD). 
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Of course a weaker condition than TVD is sufficient for convergence, since we only need 
a uniform bound of the form (8.37). For example, a method that satisfies 

TV(Q' ,+1 ) < (1 + a Af)TV(<2") (8.38) 

for some constant a independent of A t (at least for all At sufficiently small) will also be 
TVB. 



8.4 Accuracy at Extrema 

Examining the results of Figure 6.1 and Figure 6.3 shows that the high-resolution methods 
developed in Chapter 6 give rather poor accuracy at extrema (local maxima or minima in 
q ), even though the solution is smooth. Figure 8.1(a) gives an indication of why this occurs. 
All of the limiters discussed in Section 6.9 will give slopes a = 0 in cells i — 2 and i — 1 
for this data. This is required in order to prove that the method is truly TVD, as any other 
choice will give a reconstructed function q n (x, t„) with TV(§"(-, t„)) > TV(Q n ), allowing 
the possibility that TV(<2" +1 ) > TV( Q" ) with suitable choices of the data and time step. 
If the solution is smooth near the extremum, then the Lax-Wendroff slope should be close 
to zero anyway, so this is perhaps not a bad approximation. However, setting the slope to 
zero will lead to a clipping of the solution, and the extreme value will be diminished by 
0( Ax 2 ) = Of At 2 ) in each time step. After T / At time steps this can lead to a global error 
near extrema that is O(At), reducing the method to first-order accuracy in this region. This 
can be observed in Figure 6.2. Osher and Chakravarthy [351] prove that TVD methods must 
in fact degenerate to first-order accuracy at extremal points. 

Using a better approximation to q x near extrema, as indicated in Figure 8.1(b), would give 
a reconstruction that allows smooth peaks to be better represented over time, since the peaks 
are then reconstructed more accurately from the data. The cost is that the total variation will 
need to increase slightly at times in order to reconstruct such a peak. But as indicated in 
Section 8.3.5, the TVD property is not strictly needed for stability. The challenge is to find 
looser criteria that allow a small increase in the total variation near extremal points while 
still suppressing oscillations where necessary. This goal has led to several suggestions on 





Fig. 8.1. (a) A smooth maximum and cell averages. A TVD slope reconstruction will give clipping 
of the peak, (b) Better accuracy can be obtained with a reconstruction that is not TVD relative to the 
cell averages. 
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other ways to choose the limiter function and criteria other than strictly requiring the TVD 
property. Shu [407] has developed the theory of TVB methods for which convergence can 
still be proved while better accuracy at extremal points may be obtained. The essentially 
nonoscillatory (ENO) methods are another approach to obtaining high-resolution that often 
give better than second-order accuracy in smooth regions, including extrema. These are 
briefly described in Section 10.4.4. 



8.5 Order of Accuracy Isn’t Everything 

The quality of a numerical method is often summarized by a single number, its order of 
accuracy. This is indeed an important consideration, but it can be a mistake to put too much 
emphasis on this one attribute. It is not always true that a method with a higher order of 
accuracy is more accurate on a particular grid or for a particular problem. 

Suppose a method has order of accuracy s. Then we expect the error to behave like 

||£ a '|| = C(Ax)' 5 + higher-order terms (8.39) 

as the grid is refined and Ax -> 0. Here C is some constant that depends on the particular 
solution being computed (and the time T). The magnitude of the constant C is important 
as well as the value of s. Also, note that the "higher-order” terms, which depend on higher 
powers of Ax, are asymptotically negligible as Ax — > 0, but may in fact be larger than the 
“dominant” term C(Ax) s on the grid we wish to use in practice. 

As a specific example, consider the high-resolution TVD methods developed in Chapter 6 
for the scalar advection equation. Because of the nonlinear limiter function, these methods 
are formally not second-order accurate, even when applied to problems with smooth solu- 
tions. The limiter typically leads to a clipping of the solution near extrema, as discussed in 
Section 8.4. 

For discontinuous solutions, as illustrated in Figure 6. 1 and Figure 6.2, these methods have 
clear advantages over the “second-order” Lax-Wendroff or Beam- Warming methods, even 
though for discontinuous solutions none of these methods exhibit second-order convergence. 

But suppose we compare these methods on a problem where the solution is smooth. 
At least in this case one might think the second-order methods should be better than the 
high-resolution methods, which have a lower order of accuracy. This is true on a sufficiently 
fine grid, but may not be at all true on the sort of grids we want to use in practice. 

Consider the wave-packet propagation problem illustrated in Figure 6.3. Here the data is 
smooth and yet the high-resolution method shows a clear advantage over the Lax-Wendroff 
on the grid shown in this figure. Figure 8.2 shows the results of a mesh refinement study 
for this particular example. The true and computed solutions are compared on a sequence 
of grids, and the norm of the error is plotted as a function of Ax. These are shown on a 
log-log scale because from (8.39) we have 

log |£1 ^ log | C | + s log | Ax |, (8.40) 

so that we expect linear behavior in this plot, with a slope given by the order of accuracy 
s. Figure 8.2(a) shows errors in the max norm, while Figure 8.2(b) shows the errors in 
the 1-norm. The Lax-Wendroff method is second-order accurate in both norms; the slope 
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Fig. 8.2. Log-log plot of the error vs. grid size for the Lax-Wendroff and a high-resolution 
method on the wave-packet problem of Figure 6.3: (a) max-norm errors, (b) 1-norm errors. 
[claw/book/chap8/wavepacket] 

of the corresponding curves is about 1.999. Due to the clipping of peaks observed with 
TVD methods, the error with the high-resolution method is dominated by errors near these 
few locations, and so the max norm shows larger errors than the 1-norm, which averages 
over the entire domain. In the max norm the observed order of accuracy is about 1 .22, and 
on sufficiently fine grids the Lax-Wendroff method is superior. However, the crossover 
point for this particular example is at about Ax = 0.00035, meaning about 2800 grid cells 
in the unit interval. This is a much finer grid than one would normally want to use for 
this problem. Certainly for two- or three-dimensional analogues of this problem it would 
be unthinkable to use this many grid points in each direction. On the coarser grids one 
might use in practice, the high-resolution method is superior in spite of its lower order of 
accuracy. 

In the 1-norm the high-resolution method looks even better. In this norm the observed 
order of accuracy is about 1.92, but the error constant C is about 5 times smaller than 
what is observed for the Lax-Wendroff method, so that on all the grids tested the error is 
essentially 5 times smaller. Of course, for very small Ax the Lax-Wendroff method would 
eventually prove superior, but extrapolating from the results seen in Figure 8.2(b) we find 
that the crossover point in the 1-norm is around Ax = 10 33 . 

Later on we will see other examples where it is wise to look beyond order of accuracy in 
comparing different methods. For example, in Chapter 17 we will see that a fractional-step 
method for source terms that is often dismissed as being “only first-order accurate” is in fact 
essentially identical to second-order accurate methods for many practical purposes, and is 
often more efficient to use. 



8.6 Modified Equations 

As discussed in Section 8.2, the local truncation error of a method is determined by seeing 
how well the true solution of the differential equation satisfies the difference equation. Now 
we will study a slightly different approach that can be very illuminating in that it reveals 
much more about the structure and behavior of the numerical solution in addition to the 
order of the error. 
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The idea is to ask the following question: Is there a PDE to which our numerical ap- 
proximation Q 1 is actually the exact solution? Or, less ambitiously, can we at least find an 
equation that is better satisfied by Q" than the original PDE we were attempting to solve? 
If so, then studying the behavior of solutions to this PDE should tell us much about how the 
numerical approximation is behaving. This can be advantageous because it is often easier to 
study the behavior of solutions of differential equations than of finite-difference formulas. 

In fact it is possible to find a PDE that is exactly satisfied by the Q ", by doing Taylor 
series expansions as we do to compute the local truncation error. However, this PDE will 
have an infinite number of terms involving higher and higher powers of At and Ax. By 
truncating this series at some point we will obtain a PDE that is simple enough to study and 
yet gives a good indication of the behavior of the Q" . If the method is accurate to order s, 
then this equation is generally a modification of the original PDE with new terms of order s, 
and is called the modified equation for the method, or sometimes the model equation. Good 
descriptions of the theory and use of modified equations can be found in Hedstrom [193] 
or Warming & Hyett [480], See [61], [114], [126], [170] for some further discussion and 
other applications of this approach. 



8.6.1 The Upwind Method 

The derivation of a modified equation is best illustrated with an example. Consider the 
first-order upwind method for the advection equation q, + uq x — 0 in the case u > 0, 

Qi +1 = Qi- u -£^(Qi-Qi-i)- (8.4D 

The process of deriving the modified equation is very similar to computing the local trun- 
cation error, only now we insert a function v(x, t) into the numerical method instead of the 
true solution q(x, t). Our goal is to determine a differential equation satisfied by v. We view 
the method as a finite difference method acting on grid-point values, and v is supposed to 
be a function that agrees exactly with Q ” at the grid points. So, unlike q (x , r), the function 
v{x, t) satisfies (8.41) exactly: 

u At 

v(x, t + At) = v(x, t) [t>(x, t) — v(x — Ax, f)]. 

Ax 

Expanding these terms in Taylor series about (x, t ) and simplifying gives 



Or + ^ Ar v„ + ^(A t) 2 v„, H ^ + u 0* - ^ Ax v xx + ^(A x) 2 v xxx H ^ =0. 

We can rewrite this as 

1 1 , , 

v t + uv x = -(u Ax v xx - At v tt ) ~ -[u( Ax) v xxx + (AO v ttt ] H . (8.42) 

2 6 

This is the PDE that v satisfies. If we take At / Ax fixed, then the terms on the right-hand 
side are O( At), 0< At 2 ), etc., so that for small At we can truncate this series to get a PDE 
that is quite well satisfied by the Q" . 
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If we drop all the terms on the right-hand side, we just recover the original advection 
equation. Since we have then dropped terms of Of At), we expect that Q" satisfies this 
equation to O(At), as we know to be true, since this upwind method is first-order accurate. 
If we keep the O(At) terms then we get something more interesting: 

1 

v, + uv x = -{u Ax v xx - At v „ ). (8.43) 

This involves second derivatives in both x and t, but we can derive a slightly different 
modified equation with the same accuracy by differentiating (8.43) with respect to t to 
obtain 

1 

v„ = -uv xt + -(« Ax v xx , - At v,„) 
and with respect to x to obtain 

1 

Vtx — tlV xx -f — ( U Ax Vxxx At V[[ X ). 

Combining these gives 

v„ = u 2 v xx + O(At). 



Inserting this in (8.43) gives 

v t + uv x = -{u Ax v xx ~ it 2 At v xx ) + 0(At 2 ). 

Since we have already decided to drop terms of 0( At 2 ), we can drop these terms here also 
to obtain 

1 

v, + uv x — -u Ax (1 — v)v xx , (8.44) 

where v — u At / Ax is the Courant number. This is now a familiar advection-diffusion 
equation. The grid values Q" can be viewed as giving a second-order accurate approx- 
imation to the true solution of this equation (whereas they only give first-order accurate 
approximations to the true solution of the advection equation). 

For higher-order methods this elimination of t-derivatives in terms of x- derivatives can 
also be done, but must be done carefully and is complicated by the need to include higher- 
order terms. Warming and Hyett [480] present a general procedure. 

The fact that the modified equation for the upwind method is an advection-diffusion 
equation tells us a great deal about how the numerical solution behaves. Solutions to the 
advection-diffusion equation translate at the proper speed u but also diffuse and are smeared 
out. This was clearly visible in Figures 6.1 and 6.3, for example. 

Note that the diffusion coefficient in (8.43) vanishes in the special case u At — Ax. In 
this case we already know that the exact solution to the advection equation is recovered by 
the upwind method; see Figure 4.4 and Exercise 4.2. 

Also note that the diffusion coefficient is positive only if 0 < u At / Ax < 1. This is pre- 
cisely the stability limit of the upwind method. If it is violated, then the diffusion coefficient 
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in the modified equation is negative, giving an ill-posed backward heat equation with ex- 
ponentially growing solutions. Hence we see that some information about stability can also 
be extracted from the modified equation. 



8.6.2 Lax-Wendroff Method 

If the same procedure is followed for the Lax-Wendroff method, we find that all O(At) 
terms drop out of the modified equation, as is expected because this method is second-order 
accurate on the advection equation. The modified equation obtained by retaining the 0(At 2 ) 
term and then replacing time derivatives by spatial derivatives is 

v t + uv x = -\u{ Ax) 2 (1 - v 2 )v xxx . (8.45) 

6 

The Lax-Wendroff method produces a third-order accurate solution to this equation. This 
equation has a very different character from (8.43). The v xxx term leads to dispersive 
behavior rather than diffusion. 

This dispersion is very clearly seen in the wave-packet computation of Figure 6.3, where 
the Q" computed with the Lax-Wendroff method clearly travels at the wrong speed. Dis- 
persive wave theory predicts that such a packet should travel at the group velocity, which 
for wavenumber £ in the Lax-Wendroff method is 

c g = u — -u(Ax) 2 (l — v 2 )f 2 . 

See for example [8], [50], [298], [427], [486] for discussions of dispersive equations and 
group velocities. The utility of this concept in the study of numerical methods has been 
stressed by Trefethen, in particular in relation to the stability of boundary conditions. A 
nice summary of some of this theory may be found in Trefethen [458]. 

The computation shown in Figure 6.3 has f = 80, u — 1, A.r = 1 /200, and u At / Ax — 
0.8, giving a group velocity of 0.97 12 rather than the correct advection speed of 1 . At time 
10 this predicts the wave packet will be lagging the correct location by a distance of about 
0.288, which agrees well with what is seen in the figure. 

For data such as that used in Figure 6. 1 , dispersion means that the high-frequency com- 
ponents of data such as the discontinuity will travel substantially more slowly than the 
lower-frequency components, since the group velocity is less than u for all wave numbers 
and falls with increasing f . As a result the numerical result can be expected to develop a 
train of oscillations behind the peak, with the high wave numbers lagging farthest behind 
the correct location. 

If we retain one more term in the modified equation for the Lax-Wendroff method, we 
find that the Q'j are fourth-order accurate solutions to an equation of the form 

v, + uv x = \u{Axf{v 2 - 1 )v xxx - ev xxxx , (8.46) 

o 

where the e in the fourth-order dissipative term is 0( Ax 3 ) and positive when the stability 
bound holds. This higher-order dissipation causes the highest wave numbers to be damped, 
so that there is a limit to the oscillations seen in practice. 
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Note that the dominant new term in the modified equation corresponds to the dominant 
term in the local truncation error for each of these methods. Compare (8.45) with (8.16), 
for example. 



8.6.3 Beam-Warming Method 

The second-order Beam- Warming method (6.7) has a modified equation similar to that of 
the Lax-Wendroff method, 

v t + uv x = \u{ Ax) 2 (2 - 3v + v 2 )v xxx . (8.47) 

6 

In this case the group velocity is greater than u for all wave numbers in the case 0 < v < 1, 
so that the oscillations move ahead of the main hump. This can be observed in Figure 6.1, 
where v = u At/ Ax — 0.8 was used. If 1 < v < 2, then the group velocity is less than u 
and the oscillations will fall behind. 



8.7 Accuracy Near Discontinuities 

In the previous section we derived the modified equation for various numerical methods, a 
PDE that models the behavior of the numerical solution. This equation was derived using 
Taylor series expansion, and hence is based on the assumption of smoothness, but it turns 
out that the modified equation is often a good model even when the true solution of the 
original hyperbolic problem contains discontinuities. This is because the modified equation 
typically contains diffusive terms that cause the discontinuities to be immediately smeared 
out, as also happens with the numerical solution, and so the solution we are studying is 
smooth and the Taylor series expansion is valid. 

Figure 8.3 shows a simple example in which the upwind method has been applied to the 
scalar advection equation q, + uq x — 0 with discontinuous data q(x) having the value 2 
for x < 0 and 0 for x > 0. The parameters u — 1, Ax = 0.05, and At — 0.04 were used 
giving a Courant number v — 0.8. The dashed line is the true solution to this equation, 
q(x, t) — qix — fit). The solid line is the exact solution to the modified equation (8.44). 
This advection-diffusion equation can be solved exactly to yield 

= (848) 

where 

P=^uAx(l- v) (8.49) 

is the diffusion coefficient from (8.44), and the complementary error function erfc is defined 
by 



2 

erfc(x) = — = / 

J x 



e~ r dz. 



(8.50) 
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Time t = 1 Time t = 3 





Fig. 8.3. Dashed line: exact solution to the advection equation. Points: numerical solution obtained 
with the upwind method. Solid line: exact solution to the modified equation (8.44). (a) At time t — 1. 
(b) At time t = 3. [book/chap8/modeqn] 



The numerical solution to the advection equation obtained using the upwind method, marked 
by the symbols in Figure 8.3, is well approximated by the exact solution to the modified 
equation. 

It follows that we can use the modified equation to give us some insight into the expected 
accuracy of the upwind method on this problem. Comparing v(x, 1 ) from (8.48) to the true 
solution q(x, t ) = 2 H(ut — x), it is possible to show that the 1-norm of the difference is 



II <?(-, t) - v(-, r)||i 




(8.51) 



for some constant C i independent of /3 and t. Since /I is given by (8.49), this gives 



lltfO, 0 - v(-, r)||i C 2 V Ax t 



(8.52) 



as A.r -» 0 with At / Ax fixed. This indicates that the 1-norm of the error decays only 
like ( Ax) 1 / 2 even though the method is formally “first-order accurate’’ based on the local 
truncation error, which is valid only for smooth solutions. 

This informal analysis only gives an indication of the accuracy one might expect from a 
first-order method on a problem with a discontinuous solution. More detailed error analysis 
of numerical methods for discontinuous solutions (to nonlinear scalar equations) can be 
found, for example, in [251], [316], [339], [390], [436], [438]. 



Exercises 

8.1. Consider the centered method (4.19) for the scalar advection equation q t +uq x — 0. 
Apply von Neumann analysis to show that this method is unstable in the 2-norm for 
any fixed At / Ax. 
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8.2. Following the proof of Section 8.3.4, show that the upwind method (4.30) is stable 
provided the CFL condition (4.32) is satisfied. 

8.3. Consider the equation 

q, + uq x = aq , q(x , 0) = q(x), 

with solution q(x, t ) = e a, q(x — lit). 

(a) Show that the method 

er 1 = qi - u ^{Q r ; - qu ) + a taQi 

is first-order accurate for this equation by computing the local truncation error. 

(b) Show that this method is Lax-Richtmyer-stable in the 1-norm provided 
| u At/Ax\ < 1, by showing that a bound of the form (8.23) holds. Note that 
when a > 0 the numerical solution is growing exponentially in time (as is the 
true solution) but the method is stable and convergent at any fixed time. 

(c) Show that this method is TVB. Is it TVD? 

8.4. Show that a method Q' ,+l — A7 Q" ) that is contractive in the I -norm ( L 1 -contractive), 
sothat||Af(P)-A/'(0)||i < \\P —Q || i, must also be TVD, so TV(J\f(Q)) < TV(<2). 
Hint: Define the grid function P by P, — Q,-\ . 

8.5. Prove Harten’s Theorem 6.1. Hint: Note that 

Q-+1 ~ Q'! +1 = (i - c< ~ d?)(Q? + i - QI) + d? + i(Q'! + 2 - Q1+ 1) 

+ cu(Q!-QU)- 

Sum | Q'!+l — Q'‘ +] | over i and use the nonnegativity of each coefficient, as in the 
stability proof of Section 8.3.4. 

8.6. Use the method of Section 8.3.4 to show that the method (4.64) is stable in the 1-norm 
for Ax < u At < 2 Ax. 

8.7. View (8.41) as a numerical method for the equation (8.44). Compute the local trun- 
cation error, and verify that it is 0(At 2 ). 

8.8. Derive the modified equation (8.45) for the Lax-Wendroff method. 

8.9. Determine the modified equation for the centered method (4.19), and show that the 
diffusion coefficient is always negative and this equation is hence ill posed. Recall 
that the method (4.19) is unstable for all fixed At / Ax. 




9 

Variable-Coefficient Linear Equations 



In the preceding chapters we have primarily studied linear problems with constant co- 
efficients. Many interesting problems involve spatially-varying coefficients. This chapter is 
devoted to exploring some of the issues that arise, both analytically and numerically, in this 
case. 

There are several distinct forms that a variable-coefficient hyperbolic system might take, 
each of which arises naturally in some contexts. One possibility is that the coefficient matrix 
A multiplying q x varies with x, so the system is 

q, + A(x)q x = 0. (9.1) 

This system is hyperbolic in some domain if A(x) is diagonalizable with real eigenvalues 
at each x in the domain. Such problems still model wave propagation, and the finite volume 
methods developed in previous chapters can be applied fairly directly, in spite of the fact 
that this system (9. 1 ) is not in conservation form and there is no flux function. 

Another form of variable-coefficient problem that often arises is 

q, + ( A{x)q) x = 0, (9.2) 

in which the matrix A(x) appears within the x -derivative. For the constant-coefficient prob- 
lem with A(x) = A, the two forms (9.1) and (9.2) are identical, but with variable coefficients 
they are distinct and have different solutions. The equation (9.2) is a conservation law, with 
the flux function 



f(q,x) = A(x)q. (9.3) 

In this case the flux function depends explicitly on the location x as well as on the value of the 
conserved quantities q. Again this equation is hyperbolic whenever A(x) is diagonalizable 
with real eigenvalues. 

For a given physical problem it may be possible to derive an equation of either form (9.1) 
or (9.2), depending on how the vector q is chosen. For example, in Section 9.1 we will see 
how flow through a pipe can lead to an advection equation of either form, depending on 
how q is defined. 

We can go back and forth between the forms at the expense of introducing source terms. 
By applying the product rule to the x -derivative in (9.2), we could rewrite this equation in 
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the form 



q, + A(x)q x = —A'{x)q, 

which has the form of (9. 1 ) with the addition of a source term. Conversely, we could rewrite 
(9.1) as 



q, + (A{x)q) x = A'(x)q , 

which now has the form of a conservation law with a source term. Normally we wish to 
avoid adding source terms to the equation if they can be avoided by a better choice of 
variables. 

Another form of variable coefficients that often arises naturally is a capacity function 
k(x), as described in the context of heat capacity in Section 2.3, giving systems of the form 

K(x)q, + (A(x)q) x = 0, (9.4) 

for example. Again this could be manipulated into other forms, e.g., 

q, + K~ l (x)A(x)q x = —K~ l (x)A'(x)q, 

but for problems where Kq is the proper conserved quantity it may be preferable to work 
directly with the form (9.4), using the capacity-form differencing algorithms introduced in 
Section 6.16. 



9.1 Advection in a Pipe 

Consider an incompressible fluid flowing through a pipe with variable cross-sectional area 
k(x), and suppose we want to model the advection of some tracer down the length of the 
pipe. We denote the area by k, since we will see that this is a natural capacity function. 
There are several ways we might model this, depending on how we choose our variables, 
leading to different forms of the variable-coefficient advection equation. 

We are assuming here that a one-dimensional formulation is adequate, i.e., that all quanti- 
ties vary only with x (distance along the pipe) and t, and are essentially constant across any 
given cross section. In reality the flow through a pipe with varying diameter cannot be truly 
one-dimensional, since there must be a velocity component in the radial direction in regions 
where the walls converge or diverge. But we will assume this is sufficiently small that it can 
be safely ignored, which is true if the variation in k is sufficiently smooth. The fluid velocity 
is then given by a single axial velocity u(x) that varies only with x. (See Sections 9.4.1 and 
9.4.2 for other contexts where this makes sense even if u is discontinuous.) 

Note that if k(x) is measured in square meters and u(x) in meters per second, say, then 
the product k(x)u(x) has units of cubic meters per second and measures the volume of fluid 
passing any point per unit time. Since the fluid is assumed to be incompressible, this must 
be the same at every point in the pipe, and we will denote the flow rate by U , 



U — k(x)u(x) = constant. 



(9.5) 
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If we know U and the cross-sectional area k(x), then we can compute 

u(x) = U/k(x). (9.6) 



(In Section 9.4 we consider a formulation in which k(x)u(x) need not be constant.) 

Now suppose that we introduce a tracer into the fluid (e.g., some food coloring or chemical 
contaminant into water, with very small concentration so that it does not affect the fluid 
dynamics). We wish to study how a given initial concentration profile advects downstream. 
There are two distinct ways we might choose to measure the concentration, leading to 
different forms of the advection equation. 

One approach would be to measure the concentration in grams per cubic meter (mass per 
unit volume of fluid), which is what we would probably actually measure if we took a small 
sample of fluid from the pipe and determined the concentration. Call this variable q (x , t ). 

However, since we are solving a one-dimensional problem with quantities assumed to vary 
in only the x -direction, another natural possibility would be to measure the concentration in 
units of mass per unit length of the pipe (grams per meter). If we call this variable q(x, t ), 
then 




q(x , t)dx 



(9.7) 



measures the total mass 1 of tracer in the section of pipe from x\ to X 2 - Since this mass can 
only change due to tracer moving past the endpoints, q is the proper conserved quantity for 
this problem. The one-dimensional velocity u(x) measures the rate at which tracer moves 
past a point and the product u(x)q(x, t) is the flux in grams per second. Hence with this 
choice of variables we obtain the conservative advection equation 



q, + (u(x)q) x — 0. (9.8) 

There is a simple relation between q and q, given by 

q{x,t) — K{x)q(x,t), (9.9) 



since multiplying the mass per unit volume by the cross-sectional area gives the mass per 
unit length. Hence the total mass of tracer in [jti, X 2 ], given by (9.7), can be rewritten as 




K(x)q(x, t) dx. 



With this choice of variables we see that the cross-sectional area acts as a capacity function, 
which is quite natural in that this area clearly determines the fluid capacity of the pipe at 
each point. 

Using the relation (9.9) in (9.8) gives an advection equation for q. 



tc{x)q, + (; u(x)K(x)q) x — 0. 



(9.10) 



1 In problems where chemical kinetics is involved, we should measure the “mass'’ in moles rather than grams, 
and the density in moles per meter or moles per cubic meter. 
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But now notice that applying (9.5) allows us to rewrite this as 

K(x)q t + Uq x — 0. (9.11) 

Dividing by k and using (9.6) gives 



q, + u(x)q x = 0 (9.12) 

as another form of the advection equation. 

Comparing the advection equations (9.8) and (9.12) shows that they have the form of 
(9.2) and (9.1), respectively. The velocity u(x) is the same in either case, but depending on 
how we measure the concentration, we obtain either the nonconservative or the conservative 
form of the equation. The form (9. 1 1 ) is a third distinct form, in which the capacity function 
and flow rate appear instead of the velocity. 

The nonconservative form (9.12) of the advection equation is often called the transport 
equation or the color equation. If we think of food coloring in water, then it is q, the mass per 
unit volume, that determines the color of the water. If we follow a parcel of water through 
the pipe, we expect the color to remain the same even as the area of the pipe and velocity of 
the water change. This is easily verified, since q(x,t) is constant along characteristic curves 
(recall the discussion of Section 2.1.1). By contrast the conserved quantity q is not constant 
along characteristic curves. The mass per unit length varies with the cross-sectional area 
even if the color is constant. From (9.8) we obtain q t + u(.x)q x = — u\x)q , so that along 
any characteristic curve X{t) satisfying X'(t ) = u{X{t)) we have 

t) = -u\X(t))q{X(t), t). (9.13) 

dt 



9.2 Finite Volume Methods 

Natural upwind finite difference methods for the equations (9.8) and (9.12) are easy to 
derive and take the following form in the case U > 0: 

Q'T 1 = Q'i - ^[« (*i)G? - K(*.--i)e"-i] (9.14) 



and 



QT 1 = Q1~ ~ Ql-i) ( 9 - 15 ) 

respectively. Variations of each formula are possible in which we use «(x,_ 1 / 2 ) in place of 
u(xi). Either choice gives a first-order accurate approximation. In this section we will see 
how to derive these methods using the finite volume framework of Godunov’s method, in 
a manner that allows extension to high-resolution methods. 

The Riemann problem at the cell interface x,_i /2 now takes the form of an advection 
equation with piecewise constant coefficients as well as piecewise constant initial data. The 
solution depends on what form of the advection equation we are solving. It also depends 
on how we discretize the velocity u(x). Two natural possibilities are: 
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1. Associate a velocity n, with each grid cell. We might define n,- as the pointwise value 
Uj = u(xj ), or we could compute the cell average of n(x), or we could view the pipe as 
having a piecewise constant cross-sectional area with value k ,■ in the / th cell, and then 
take M; = U/Kj. 

2. Associate a velocity m,-_ 1/2 with each cell interface x,_i/ 2 - 

There are certain advantages and disadvantages of each form, and which is chosen may 
depend on the nature of the problem being solved. To begin, we will concentrate on the 
first choice. For some problems, however, it is more natural to use edge velocities n,_i/ 2 , 
and this choice is discussed in Section 9.5. This is true in particular when we extend the 
algorithms to more than one space dimension; see Section 18.2. 



9.3 The Color Equation 

In the case of the color equation (9.12), the Riemann problem models a jump in color from 
Qj-i to Qi at Xi- 1 / 2 . Since color is constant along particle paths, this jump in color will 
simply propagate into cell C, at the velocity m, in this cell, and so in the notation of Chapter 4 
we have a wave W,_ 1/2 (omitting the superscript 1, since there is only one wave) with jump 

Wj_i/2 = Qi — Qi- 1 , 



and speed 



•Si- 1/2 — Ml- 

Again we are assuming positive velocities n, in each cell C, . If t/ < 0, then we would have 
Si- 1/2 = Uj- 1, since the wave would enter cell i — 1. In spite of the fact that the color 
equation is not in conservation form, we can still view Q" as an approximation to the cell 
average of q(x, t) at time t n . In this case Q" is simply the “average color” over this cell, and 
Ax Q'l is not the total mass of any conserved quantity. It is no longer true that the change in 
this cell average is given by a flux difference, but it is true that the change can be computed 
from the wave and speed. Note that (for U > 0) only the wave W;_ 1/2 enters cell 6j, and 
the value of q in this cell is changed by Q , — Qi-\ at each point the wave has reached. The 
wave propagates a distance ^ — 1/2 At over the time step and hence has covered the fraction 
Sj- 1/2 At / Ax of the cell. The cell average is thus modified by 

Qi +1 = Q'l - 

This gives the upwind method (9.15). Also note that this has exactly the form (4.43), 
e; ,+1 = qi - ^- x {A + ^Qi-t,2+A-AQ i+V2 ), 

if we define 

A + &Qi-\/2 = £/— 1/2VV)— 1/2, 

A Ag;_i /2 = 0 . 



(9.16) 




